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AND LEARNING ABILITY: 


A COMPARATIVE-EXPERIMENTAL APPROACH 


GILBERT W. MEIER 
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CARSON Y. NOLAN | 
American Printing House for the Blind 


ESPITE THE fact that it has been 
strongly suspected for nearly a cen- 

tury (Little, 1861) that severe anoxia dur- 
ing early development may be responsible 
for certain neurologic disorders and mental 
retardation in children who would otherwise 
be normal, the problem of determining the 
actual contribution of the anoxia to the 
abnormality of behavior has remained an 
exceedingly difficult one. Relatively recent 
attempts to study the question by controlled 
animal experimentation in the laboratory 
have had the advantages of more accurate 
knowledge and management of the oxygen 
deficiency to which the animals are exposed, 
but they have encountered the difficult ques- 
tion of the measurement of psychological 
Processes which may be adversely affected. 
The researches discussed in the following 
monograph have tried to evaluate the effect 
Of oxygen deficiency prior to birth and at 
birth. "They are the culmination of an or- 
ganized program initiated in the Depart- 
ment of Psychology at Washington Univer- 
sity Which has been generously supported 
by grants from the National Institute of 
Mental Health, United States Public Health 
Service (M-362 and M-762), and the 
United Cerebral Palsy, Incorporated (PL- 
45). Essentially, these studies have shown 
that a physiological insult, such as anoxia, 
incurred during early development can have 
an effect on psychological processes, at least 
those involved in problem solving behavior. 
The mechanisms by which these changes 
are effected have not been elucidated. This 
elucidation is the purpose of further re- 
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search which, at this time at least, points to 
а connection with the investigations on the 
effect of early experience. 


The studies reported upon here, along 
with those which have followed the program 
initiated at Washington University (Meier, 
in press; Meier & Menzel, 1955), illustrate 
а point of view which the authors (among 
others) feel needs re-emphasis at this time. 
Within the range of species of animal life 
adaptable to laboratory conditions, the simi- 
liagties between them should prove more 
important than the differences. Certainly 
this is true for the understanding of be- 
havior rather than behaviors, which are the 
unique presentations of a particular species. 

Clearly, this idea is not new. In those 
other life sciences in which systems or parts 
of organisms are of prime interest, e.g., 
ecology, morphology, and physiology, this 
truism has been utilized, as well as realized, 
for some time. For psychology, the usage 
of a wide range of species offers obstacles 
of greater proportion than those apparent 
to the other sciences. Where the time of 
laboratory maintenance is long, which is the 
usual case in behavioral research, considera- 
tions Of size, reproductivity, diet, physio- 
logical rhythms, and tractability must be 
made. In addition, as we have had to do 
here, the investigator may have to consider 
the methodology of testing behavior, includ- 
ing the design and complexity of problems 
available (or which could be feasibly 
offered), the nature of the growth curve, 
and the relative developmental stage at 
birth. All of these appear to be outside the 
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Scope of the problem of immediate experi- 
mental interest even though the extent to 
which each or all of them are evaluated will, 
to some degree, determine the manner of 
evaluation and extent of generalization of 
the results and conclusions of the research. 

As stated, the impetus for the original 
researches cited here were the clinical prob- 
lems centering upon the lack of oxygen 
during early development. For these investi- 
gations, the cat and rat (and, more recently, 
the chick) were chosen as experimental 
media. These choices were dictated by the 
particular phase of the problem studied and 
the characteristics of the species. ( With the 
recent studies involving the chick, the un- 
usual technical expedience for prenatal de- 
Privation was the major consideration.) 
We feel that these investigations involving 
а multispecies approach not only demon- 
strate the operational practicality of the ap- 
proach itself, but make the results more 
tenable, generally, and indicative of be- 
havior changes which are the outcome of a 
disruption of a Sensitive metabolic balance 


in periods of maximal cellular differentia- 
tion and growth, 


SOME ASPECTS OF THE PHYSIOLOGICAL 
MECHANISMS IN ANOXIA 


Concern us here, 


The premature organism, in these studies 
the fetus and the neonate, is characterized 


by certain metabolic feature 


1A fuller discussion 
maturation and metab 
wich's Brain Metab 
(1951). 


of the relationships between 
olism will be found in Him- 


olism and Cerebral Disorders 


vived an atmosphere of pure nitrogen as. 
long as 44 min. Under similar circum- 
stances, the neonate survived 21 пи 
whereas the 19 day old infant survived aoi 
12 min. (Glass, Snyder, & Webster, 19 otf 
Depriving chicken embryos and gren 
oxygen, Meier (unpublished data) | din 
similar relationships. At the end of 1 97, 


ав. 210 
the median lethal dose (ТО) pus 65] 
min.; at 4 days, 105 min.; at 8 days 35. 


min.; at 12 days, 43 min.; at 16 сау on 
min.; and at 19 days, 25 min. In bo iom 
searches, the ability of the young organ bly” 
to survive the deficiency was consid ae 
greater than that of the old. The pon func- 
gether suggest a negative exponentia with 
tion; the ability decreases sharply W" P. 
maturation. А іп. 
The explanation of these айе | 
tolerance to an oxygen lack in the ii ro- |, 
ment lies in the nature of the energy ee 
duction at the various stages in dev ismi 
ment. In the early embryo the metabo е- 
is entirely anaerobic; in the adult, ie eo 
tabolism is largely aerobic. This P овса! | 
capitulation of phylogenesis in physi? lexity | 
terms in that with the increasing со 2 
of structure and function there а „ation. | 
increasing efficiency of energy moby ts has | 
With reference to metabolism t erobic | 
meant a change from anaerobic tO stion 0 5 
utilization of nutrients in the produ welve 
energy; this offers the organism Himwich 
fold increase in energy output ( | 
1951, p. 143). - survive” 
Second, the limit on organismic the. 
bility to oxygen lack is set, in part e in | 
peculiarities of the nervous system. e 
creasingly greater role of автор. 
olism during development descri п, 
system тоге aptly than any other "jsm's | 
body. Although at no time in the он апаег | 
life is this system completely witho"! erobi€ 
obic metabolism, the dominance © specially | 
over anaerobic increases rapidly, < 
in the period immediately after á 
arrival at the adult metabolic statt? at 4 
on the species and its characteris che 
maturation. In the rat this level di a 
at 21 days after birth ; in the dog de lap$ 
ble functioning occurs following Fazekas 
of 35 to 40 days (Himwich & 
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1941). (For this comparison it is assumed 
that the dog is no more immature at birth 
than is the rat.) E 

It follows from the foregoing that pri- 
mary consideration should be given to the 
physiological mechanisms of the nervous 
system. This should not be construed to 
be a theoretical bias on the part of the 
authors with regard to the importance oi 
this part of the organism in the totality of 
its behavior. Insofar as oxygen and the 
physiology of the body is concerned, the 
nervous system is the most readily affected, 
in both the young and the old. When it 
ceases to function under abnormal condi- 
tions of availability of. oxygen, eventually, 
so does the remainder of the organism. 

Third, in addition to the uniqueness of 
the metabolic processes of the nervous sys- 
tem as related to the survivability of the 
organism, a further fact must be recalled : 
there are certain delaying actions on the 
resistance of the adult to physiological in- 
sult. This is the fact of the irreversibility 
of the developed nervous system once the 
proliferation of the neuronal cells ceases; 
shortly after birth. Although these cells may 
continue to grow in size or in the length o 
their fibrous processes, they do not multiply. 
Although there occurs a regeneration of 
parts of the individual cells, regeneration 
of parts of the system is absent. In terms 
of the viability and behavior of the organ- 
ism, this means that in the older individual 
the debilitation following oxygen depriva- 
tion (or some other such insult ) will be 
more extensive and less transitory. Further, 
artificial aids in the sustenance of life (as 
some sort of technique for inducing and 
continuing respiratory activity) will be 
effective only so long as they are in opera- 
tion ; expectations that the organism will re- 
gain the facilities to carry on these vital 
functions are futile. Lastly, death may fol- 
low the deprivation after an appreciable 
lapse of time because of the inability of the 
mature system to replace the integral cells 
which are now fully nonfunctional and de- 
generated. 

Finally, besides the transition in the rela- 
tive roles of the two forms of metabolism, 
an equally apparent transition is evidenced 


by the physiological and behavioral impor- 
tance given to specific portions of the nerv- 
ous system at particular phases in develop- 
ment. In terms of the individual's develop- 
ment, the importance of each of the portions 
follows closely the patterns seen in phyletic 
growth. As indicated by the rate of differ- 
entiation and growth of the constituent 
cells, the level of oxygen consumption, and 
the rate of integrated function, the develop- 
mental hierarchy proceeds from the lowest 
centers in the medulla and cord to the high- 
est in the diencephalon and cerebrum. The 
converse of this is depicted by the adult 
undergoing oxygen derivation (by drugs, 
simulated high altitudes, etc.). The cortical 
functions are the first to cease; the medul- 
lary and spinal are the last. 

For the birds and mammals the climax 
in the developmental hierarchy is not 
reached until after birth. Conversely, in the 
neonate, centers other than those in the 
cerebrum are functioning maximally ; these 
are the midbrain, pons, and medulla. As in- 
dicated by electroencephalographic activity, 
a crude measure at best, the first appearance 
of fully integrated neural activity is in the 
last third or last quarter of gestation (Jas- 
per, Bridgman, & Carmichael, 1937). At 
parturition, the activity is still slight, with 
irregular, slow frequencies and low ampli- 
tudes. In most forms the adult pattern is 
reached considerably later (in the human 
about 10-12 yrs.) (Charles & Fuller, 1956; 
Е. A. Gibbs & Е. 1. Gibbs, 1950; Libet, 
Fazekas, & Himwich, 1941 Vis 

So far, the discussion has been upon the 
descriptive stages in early functional devel- 
opment. Each of these stages encompasses 
but a brief temporal interval. In such longi- 
tudinal researches, as in one reported in this 
series, these stages, individually, are of un- 
known significance as determined by the 
data at hand. That is to say, to date there 
is insufficient evidence by which one can 
competently describe the relation between 
the time of the physiological insult and the 
particular structures undergoing principal 
development or the sequelea in the realm of 
behavior dysfunctions. Possibly, there 1$ а 
family of curves representing each of the 
structures, generally alike but each with а 
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different time scale. Each curve rises to a 
maximum at the period of greatest sensi- 
tivity to the insult, then maintains that high 
point (Levinson, 1952) or, for less com- 
plex systems, slowly falls (Ingalls, 1947a, 
1947b; Meier, 1953). Physiologically, this 
is a description of the possible bases of the 
behavioral phenomena of "critical period" 
and "psychological irreversibility." 

The relation between developmental age 
(as determined by the level of oxygen con- 
sumption) and the severity of effect shows 
no sign of confounding with specific mor- 
phological or functional deviations which 
could denote impairments in localized struc- 
tures or Systems. This poses an interesting, 
though still unresolved, problem. Through 
the stages of fetal and postnatal develop- 
ment, the center making the greatest metab- 
ойс demands шау not be among those 
centers most adversely affected. Acute al- 
ternations in the young organism are not 
necessarily followed by appropriate chronic, 
lasting deviations. 

Correlated behavioral-histological studies 
have born out this Observation. Windle, 
Becker, and Weil (1944) have reported 
permanent histological sequelae to natal 
asphyxia as varied, with localized and dif- 
fuse degeneration. No relation with par- 
с, rre Structures was apparent. 
Basics EA ea ioe gad кааш. be- 
aid Ve rea These animals 

аа showed marked sensory— 
motor dysfunctions (and asphyxia) for the 
first few days followin birth. TI in- 
cluded tremors, spasti E dde ee 

» Spasticity, rigidity, somno- 
lence, and paralysis. For the most part 
these disappeared within а few days re 
upon the animal assumed the air ‘of nor- 
malcy. ( The histopathology immediately 
after deprivation, that 1s, within one to five 
days after birth, consisted of, largely, mul- 
tiple, diffuse capillary hemorrhages.) 

Observing only behavior changes, Meier 
and Menzel (1955) reported that certain 
changes in behavior did follow deprivation 
at specific periods in fetal growth. How- 

ever, rather than use the amount of oxygen 
metabolized as an index for specified peri- 
ods of development, they used growth 


schedules based upon cell size and prolifera- 
Чоп. In this experiment, chicken eggs were 
deprived of oxygen through immersion in 
water at incubation temperature, Growth 
Schedules were those listed in Hamilton's 
(1952) Lillie's Development of the Chick. 
On the simple, visually oriented tasks used 
in the study, that group which had under- 
gone deprivation when the visual-motor 
system was developing maximally (about 
the eighth day of incubation) demonstrated 
the greatest differences in behavior. Like 
the findings of Windle and Becker (1943), 
some of the differences disappeared within 
а week or so after hatching. Unlike their 
findings, however, differences appeared at 
this time which were not detected on the 
first behavioral tests. At this time other 
groups (those deprived at 12 and 16 days) 
were different from the controls. As sug- 
gested in this report, there may be a differ- 


ential effect of the early posthatching ex- ' 


perience. 

Typically, increases in complexity of 
structure and function and in efficiency of 
action are related to increases in complexity 
of behavior need satisfaction and/or to in- 
creases in the number of environments com- 
patible to the organism. Growth in any 
environment means a dependence of the or- 
ganism upon certain features (moisture, 
OXygen, pressure, radiation, and tempera- 
ture) necessary for normal development. 
Although these of themselves are not suf- 
ficient, any limitation in the balance of these 
features may halt the developmental process 
and, possibly, life as well. In any environ- 
ment these features are not constant ; to the 
organism, they vary capriciously. Although 
these physicochemical processes of develop- 
ment may not be balanced on a pinnacle, 
certainly they are balanced on a tiny 
plateau. The confines of this plateau are the 
limits of adaptability and, ultimately. sur- 
vivability. 


L NATAL ANOXIA AND LEARNING OF 
Rats AT MATURITY 


Previous studies (Armitage, 1952. Becker 
& Donnell, 1952; Hurder & Sanders. 1953: 
Meier & Bunch, 1950; Scheidler,. 1953; 
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Youngdahl, 1948) of the effects of natal 
anoxia on later learning have produced 
equivocal results. The present study is one 
of several directed at a re-examination of 
this problem. Past studies have generally 
utilized relatively simple problems to test 
learning differences among the groups em- 
ployed. It was believed that (a) use of 
more complex learning tasks, (b) measures 
of retention of such tasks after short time 
periods, and (c) measures of resistance of 
learned responses to extinction might effec- 
tively demonstrate differences unrevealed in 
the past. 

The present study tests the general hy- 
pothesis that deprivation of oxygen imme- 
diately following birth results in reduced 
learning ability for rats at maturity. Two 
levels of deprivation, one-half hour at 
2.97% equivalent oxygen (Group I) and 
one hour at 2.97% equivalent (Group II), 
were used. The 14 unit Stone type water 
maze was believed to be a learning task of 
greater complexity than those previously 
used. Both groups learned and relearned 
this problem. Following the learning and 
relearning of the 14 unit water maze, Group 
I learned a bar pressing response in the 
Skinner box which was later extinguished, 
while Group II learned and reversed a 
Simple position habit. 


Subjects 


One hundred and fifty-six animals from 18 litters 
were used in Group I of the study and 59 animals 
from 9 litters were used in Group II. Of the 156 
in Group I, two died in the dessicator under anoxic 
Conditions, and 43 (23 controls, 20 anoxic) died in 
their home cages before reaching 60 days of age. 
No Significant differences in the mortality rates 
existed between the groups during this time. Of 
the 59 m Group 1I, 10 died in the dessicator under 
anoxic conditions, and 2 (1 control, 1 anoxic) died 
later in the home cages, All animals were from 
the colony of the Department of Psychology, 
Washington University. They were housed in the 
racks of metal cages, three to four animals to the 
cage. The standard colony diet was Purina labo- 
ratory chow and water. 


Apparatus and Procedure 


Deprivation 


The apparatus used to produce anoxia in the 
animals consisted of a vacuum bell jar with a small 


vacuum pump attached. A manometer measuring 
from 0 to 90 cm. of mercury was attached to indi- 
cate the air pressure within the bell jar. The 
amount of air flowing through the system was 
regulated by a stopcock, the adjustment of which 
allowed airflow through the chamber at all times 
so as to prevent stagnation of air within the cham- 
ber itself. 

Through close observation of pregnant females 
near term, newborn litters were obtained for use 
within three hrs. of birth. The criterion for rejec- 
tion of litters was suckling: those litters of animals 
which showed milk in the stomach were rejected. 
Acceptable litters were removed from the maternal 
cage, divided randomly in half, and those animals 
designated as controls replaced. The experimental 
animals were marked by excising some of the 
tissue of the left earbud. 

Following marking, they were immediately sub- 
jected to an environment of reduced oxygen con- 
tent. This was accomplished by placing the animals 
in the bell jar from which the air was gradually 
withdrawn over a 10 min, period. At the end of 
this time a difference in pressure of 66 cm. of mer- 
cury existed between the inside of the chamber and 
the outside air. This environment of less than 3% 
equivalent oxygen was maintained for a period of 
30 min. for animals in Group I of the study and 
60 min. for animals in Group II, At the end of 
the appropriate period the pressure was returned 
to normal; this process also consumed 10 min. 
Fatalities during deprivation were less than 196 for 
Group I and 3196 for Group II. The animals were 
returned to their mothers and reared normally. 
Littermates were later further differentiated by 
notching the right ear. 


Behavior Measures 


A straight pathway tank was used to accustom 
the animals to swimming. This tank was 108 in. 
long, 15 in. deep, and 8.5 in. wide, The depth of 
water in the tank was 9 in. At the end of the tank 
was a wire mesh ladder which led to an enclosed 
platform. % 

Animals were given an initial swimming experi- 
ence between 65 and 72 days of age. This range 
of age differences was necessary in order to run 
each litter as a group. The initial swimming ex- 
perience consisted of five trials per day in the 
straight runway tank for each of five days. 

A 14 unit water T maze of the Stone type was 
used to measure learning. This apparatus has been 
previously described by Licklider and Bunch 
(1946). А wire mesh ladder at the end of the 
maze led to an enclosed compartment in a revolving 
horizontal drum where the animals were housed 
between trials. The temperature of the water 1n 
the maze was approximately 70? F. at all times. 
The water filled the maze to a depth of about 9 in. 

Maze training for all groups was started the 
day following the finish of practice swimming. 
The animals were run 18 massed trials per day in 
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the multiple T maze until a criterion of one error- 
less trial was reached. The motivation used was 
escape from water. Each trial was scored for 
errors. Ап error was defined as entrance of the 
rat's head and shoulders into a blind alley. Time 
between trials varied depending on the number of 
animals in the litter and the degree of learning of 
the animals. Time between trials was never more 
than 63 min. (in the early stages of learning) or 
less than 5 min. 

Thirty days following the attainment of the 
criterion for initial learning, the animals were re- 
quired to relearn the maze. Running conditions 
were as previously described. The criterion for re- 
learning was again one errorless trial. 

Two standard type Skinner box 
Ployed with Group I. To each was 
electric counter which recorded the number of 
ümes the bar was pressed. The boxes were ad- 
justed so that pressure on the bar resulted in the 
sound of the buzzer which was immediately fol- 


were ет- 
uttached an 
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rewarded responses had occurred. Extinction ot 
the bar pressing habit was started the day follow- 
ing the last dav of the reward training. Extinc- 
tion of the habit was accomplished by placing the 
i i zn sd a 20 min. perio 
animals in the Skinner box for : ug Р 
each day until а criterion ої five or less d 
рег 20 min. period was reached. During ee ES 
trials по reward was given Гог bar prose сЕ 
sponses. А record was kept of the go се pui 
period and of the number of periods require 
extinction, . А 
А single unit T maze (previously described i 
Meier & Bunch, 1950) was used to measure habi 
ersal with Group П of the experiment. This 
maze w tank, the stem of which measure E 
in. and the arm 36 in. The maze was 15 in. deep 
8 in. wide in the stem, and 6 in. wide in the NT 
The ends of the arms were extended forward oe 
parallel with the stem, making the ladder by m 
the animals were permitted to escape invisible iu th 
the choice point. Water was maintained at а € «РЕ 


lowed by release of a pellet composed of 50% of 9 in. at a temperature of approximately 7 T 
Purina laboratory chow and 50% sugar. Habit reversal training in the single T mer ant 
. Pretraining for the Skinner box was carried out started for Group П at 120 days of age. All @ 
in а "pseudo" Skinner box. Тһе interior of the mals were required through massed practice 
pseudo” box was identical to the Skinner box learn a simple left turning habit to a criterion. р 
employed except that it did have a stationary food 10 successive errorless trials. Two days. follow Ye: 
tray but did not have the bar, il original learning, the animals were required tO ia 
At 150 days of age pretraining for the Skinner verse the left turning habit to a right are а 
box was started Гог Group I. The animals were habit. Again learning was by massed practice 
placed on a 23 hr. hunger schedule. On each of the criterion of 10 errorless trials. 
1 consecutive days access (о food of the type used 
ог reward in the Skinner box was allowed in the Results 
zu cage for a one hr. period. Water was avail- DON rd 
able at all times, Three days of training were next Table 1 gives the means and standa ў 
given in the “pseudo” Skinner box. During this фае EZ : for the con 
period the animals were allowed to eat for ап hour — deviations for trials and SHOE Ee for 
"Thes тот the food trays of the training box. trol and one-half hour anoxic n the 
ir ; ; A i Р А 
manner 0 САУ ration was obtained in this both original learning and relearning ! ple; 
Following pret 14 unit maze. As indicated in the € 
raining the ani d м tats” 
of ik days training in the A eg en E none of the mean differences was $ 
ot thes М 4 м "n 
€ days the rat was left in the box until 30 tically significant. 
TABLE 1 
COMPAR: — G OF 
n pres OF TRIALS AND ERRORS ron INTIAL LEARNING AND RELEARNIN 
NIT Maze Bv CONTROL AND ONE-Hatr Hour ANOXIC GROUPS 
Control Anos 
noxic 
KEDÎ Cus 
Mean SD Mean SD i NE 
Initial Learnin, в 05 
Hals; © 22.89 14.86 22.30 15.24 .20 ms 
Relearning 5 
oa 6. .0 
Initial Learning 173 A 4.94 7.42 6.99 25 > 05 
Errors: É 125.36 154.98 104.16 .86 me 
Relearning 5 
25.69 25.66 25.13 23.62 42 >. 
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TABLE 2 


COMPARISON OF NUMBERS OF RESPONSES AND Days NECESSARY FOR EXTINCTION OF A Вав PRESSING 
HABIT BY CONTROL AND ONE-Hatr Hour Anoxic Groups 


Control Anoxic 
(№ = 15) (№ = 19) 
Меап SD Mean SD t b 
84.66 34.84 7.21 51.88 .79 2.05 
a 4.20 1.47 4.47 1.93 .45 >.05 


Table 2 compares the control and one- 
half hour anoxic groups for the number of 
responses and days necessary for extinction 
of the bar pressing habit in the Skinner 


box. The lack of significant differences for 


these variables is demonstrated by the small 


| 


7 


t ratios given in the table. 

Table 3 gives the means and standard 
deviations for trials and errors for the con- 
trol and one hour anoxic groups for both 
original learning and relearning in the 14 
unit maze. Examination of the f ratios re- 
ported in this table reveals that controls 
took fewer trials (p < .01) than anoxics on 
both original learning and relearning. Con- 
trols also made fewer errors for both ses- 
sions (р < .05) than did the group with 
one hour oxygen deprivation. 

In Table 4 are found the numbers of 
trials and errors received by the control and 
one hour anoxic groups in learning a simple 
left turning response and then reversing it 
after a two day period. No significant dif- 


ferences existed between the groups for 
trials or errors during position learning. 
Significant differences, favoring the control 
group, did exist for both trials (p < .05) 
and errors (р < .01) when the response 
was reversed. 

A comparison of variances of controls 
and anoxics of Groups I and II for all vari- 
ables measured is presented in Table 5. For 
Group I, significant variance differences oc- 
curred only in the case of relearning trials 
for the Stone maze. All variances differed 
significantly for Group II except the vari- 
ances for errors in position learning. 


Discussion 


The results of this study indicate that de- 
privation of oxygen for a 30 min. period 
following birth does not affect later ability 
of rats to learn and relearn a Stone type 
maze or to retain a bar pressing habit. 
Deprivation of oxygen for a 60 min. period 


TABLE 3 


COMPARISON or NUMBERS оғ TRIALS AND ERRORS FOR INITIAL LEARNING AND RELEARNING OF 
14 UNIT Maze ву CONTROL AND ONE Hour Anoxic GROUPS 


Control . 
(N = 23) 
Mean SD 
Original Learning 20.00 8.96 
Trials: 
Relearning 5.56 3.77 
Original Learning 149.04 63.48 
Errors: 
Relearning 14.83 10.72 


Anoxic 
(N — 24) 
Mean SD t Р 
ПЕНИЕ Pet PE 
30.50 13.50 3.09 «.01 
10.46 7.64 2.74 <.01 
239.11 160.32 2.50 <.05 
40.96 47.17 2.29 <.05 
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TABLE 4 


E. SIN d EVERSAL OF A 
NITIAL LEARNING AND REVER 
COMPARISON ОЕ № T хр Errors FOR INIT: 4 

N NUMBERS OF TRIALS AND н I ies 

i Position HABIT ву CONTROL AND One Hour Anoxic Groups 


Control Anoxic 
(N = 19) (N = 21) 
ae سے‎ Ф 
Меап 5р Меап 5р t 
- >.05 
5.30 72 
Position Learning 12.68 2.18 13.62 5.3 es 
Trials: ۴ 14.52 4.28 2.60 05 
Position Reversal 11.95 1.10 = ee "38 = 
Position Learning 1.47 1.04 ш Me 
m 5 7.95 3.80 4.21 
Position Reversal 3.95 1.82 P 
-—  — 2 CJ M АВА EE Nard Nus Ne — 


following birth does affect the later 
of rats to learn and relearn а Ston ) 
maze and to reverse а position habit previ- 
ously learned. The meaning of the results 
шау best be considered when compared with 
results of other recent studies, 

Windle and В 
and Donnell (1 


ability 
e type 


ers to severe oxygen 
off intrauterine circu- 

lation. Armitage (1952) reported that ad- 
ministration Of varying doses of drugs to 
emale rats on the day prior to parturition 

ada detrimental effect on the later learn- 
ing ability of the young. Scheidler (1953) 
found that the offspring of female tats who 


been exposed to a simulated altitude of 
30,000 ft. for 2 hrs. just prior to birth 
learned less well than the offspring of nor- 
mal mothers, Significant differences be- 
tween anoxics (one-half hour 


lent oxygen immediately fol 


deprivation by cutting 


abits are 
1950). 

hese positive fi 
dings of two stu 
dahl (1948) deprived halves 
oxygen within 6 h 
tion was accomplis 
of the atmospher 
equivalent oxygen 
No differences we 
tween anoxics an 
age. No differenc 


reported by 


ndings are 
dies, Young- 
of litters of 


€ of a chamber to 5% 
in. period. 


es were found by Hurder 


red 

and Sanders (1953) when they К. 
rats which had beeen exposed toa WES age 
altitude of 34,000 ft. for 34 min. в; 3 hr 
of 1 hr., rats exposed to 30,000 ft. 0 
at the age of 1 hr., and normal anima e" 

The authors believe that the am 
effects of anoxia at birth have саа "ther, 
tively established. They believe, m for 
that perhaps two factors are i = IL in 
the varying results of Groups I - p^ 
the present study and the positive = above: 
tive results of the studies describe -of the 
These factors are (a) the ape e che 
experimental animals at birth T4 the ех” 
amount of deprivation received by 
erimental groups. , 
р In the sade (Айн 1952) A w 
guinea pigs were used as subjec are re 
these factors acting in саир The 
sponsible for the positive resul e with 
guinea pig is very mature at birt пох 
а consequent low resistance to ер” 
(Fazekas et al., 1941). Very severe Б 
vations (almost complete denial of £ were 
in some cases oxygen or gas mixtt & 
required to revive the experime * 
mals) were used in these studies. 1i 
ity is attested by the high morta 
(36%) in the experimental gra 
fore, low resistance combined к. posit 
deprivation produce the consisten 
findings reported. р 

It is in die studies of rats that 1D 


€ 


ver 
sever 


con" 


1 


nich | 


< 


регез 


anim E 
sistent findings are reported. These 


Е be 
being very immature at birth can 


е 
istant 
pected to have a high degree of resis 
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to anoxia (Fazekas et al, 1941). Young- 
dahl (1948), Meier and Bunch (1950), and 
{һе experimenters in this study (Group I) 
used about the same degree of deprivation 
(approximately 366 equivalent oxygen. or 
46,000 ft. for 30 min.). Hurder and Sand- 
ers (1952) subjected their animals to 34,000 
ft. for 38 min. or to 30,000 ft. for 120 min. 
Mortalities for the studies reporting them 
are as follows: Meier and Bunch, overall 
mortality of the anoxic group exceed the 
controls by 2.1%; Hurder and Sanders, no 
losses from anoxic processing, two unspeci- 
fied animals died; and Group I of the 
present study, 196 died while being rendered 
anoxic. Youngdahl does not report these 
data. With the exception of the limited 
findings of Meier and Bunch, no differences 
were found between the experimental and 
control groups in these studies. 


Scheidler (1953) subjected pregnant rats 
to a simulated altitude of 30,000 ft. for 2 
hrs. Armitage (1952) subjected pregnant 
rats to varying doses of drugs on the day 
prior to the birth of the young. Mortality 
rates in these studies were: Scheidler, 3996 
of the animals died during the deprivation 
period; and Armitage, 319% overall mor- 
tality in the anoxic litters as compared to 
16% for the controls. Both of these studies 
showed positive results. 

Consideration of the above findings has 
led the authors to make the following for- 
mulation which is believed to be of heuristic 
value: 

In any large population it is likely that 
the ability to resist deprivation is normally 
distributed. Some few individuals will be 
readily susceptible, some few will be highly 
resistant, and the majority which lie be- 


TABLE 5 
COMPARISON OF VARIANCES FOR CONTROLS AND ANOXICS OF GROUPS I AND П FOR ALL VARIABLES 
Control Var. Anoxic Var. F Ratio Р 
Group I 
Stone Maze 
Trials: к. 8 
itial Learni 220.82 232.26 1.05 >.05 
Me S 24.40 48.86 2.00 <.01 
Еггогв: А 
itial L i 15715.13 10849.31 1.45 2.05 
аа 658.44 557.91 1.18 >.05 
4 53 56 
Skinner Вох 
Responses to extinction 1213.83 2691.53 2.22 2.05 
Days to extinction 2.16 3.72 1.72 2.05 
df 14 18 
Group II 
Stone Maze 
Trials: 
Initial Learning 80.28 182.25 2.9 <.05 
Relearning 14.21 58.37 4.11 «.01 
Errors: 
Initial Learning 4029.71 25702.50 6.38 <.01 
Relearning 114.92 2225.01 19.36 «.01 
df 22 23 
Single T Maze 
Trials: Р А 
Position Learning 4.75 28.09 5.28 «.01 
Reversal 1.21 18.32 15.14 <.01 
Errors: 
Position Learning 1.08 1.56 1.44 5.05 
Reversal 3.31 14.44 4.36 «.01 
df 18 20 
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tween these extremes will be grouped 
around some mean resistance value. [t is 
possible to divide the total area of this 
normal curve into many small areas so that 
the percentage of cases falling in each arca 
is known. The anoxic tolerances for indi- 
viduals in each of these small areas can be 
empirically determined by steadily increas- 
ing the amount of deprivation and noting 
mortalities. It is important to consider what 
happens to individuals in the small areas 
adjacent to that area in which fatal depri- 
vation occurred. This is illustrated in 
Figure 1. Individuals to the right of this 
small area have all succumbed earlier be- 
cause their resistance was less. Individuals 
to the left have been affected to varying 
extents. A graph can be made giving the 
proportion of fatalities in the population for 
each successive increment of deprivation. 

his assumes a linear relationship between 
deprivation and proportion of fatalities, al- 
though other predictable 


be substituted, 
Tt has already 


relationships could 


\ been stated (Gomez & 
Pike, 1909; Grenell, 1946) that a gradient 
ОЁ resistance to anoxia ex 

Centers ате more susceptible than lower cen- 
ters. This would imply that a gradient of 
effect for a given level of deprivation oc- 
curs extending in the direction of high re- 


Sts ; higher mental 


А. 


-2 = 
P | о 1 2 3 
Resistance 1o Anoxia "i 
(Sigma Units) 
8 Lethol 


Sub- Lethal 


i о 1 2 3 
Resistance fo Anoxia ні 

(Sigmo Units) 
Fi, 1, Schematic re 
of tolerance to 
of subsequent al 


Presentation of 
Oxygen deficiency and 
terations in behavior, 


distribution 
Probabilities 


sistance. А given deprivation would pro- 
duce а proportion of fatalities. Individuals 
falling in that part of the curve айша 
їһе агеа of fatality would be very ee 
affected, at the point of death, with Fra 
affects diminishing as distance toward high 


resistance decreases. 


The above formulation is important a 
statistical reasons. Let it be assumed n 
the normal curve for resistance to -—«— 
has a mean of zero and a sigma of one. or 
it be assumed that the gradient of effect cA 
deprivation extends for one sigma № — 
the left of the arca of fatality for any Br is 
deprivation. If a sample of ааа 
submitted to oxygen deprivation to a 0 
tent that 5% are fatalities (Figure E 
what is the effect on the rest of the ey Б 
tion? [n this case the distance from iii 
edge of the fatal area to the mean 01 ihid 
distribution is 1.64 sigma units. If nit 
gradient of effect extends one sigma Ч e 
towards the mean, 21.0% of the total e 
ple has suffered some effect less than mee 
Let the case where the deprivation is мт 
that fatalities have amounted to 25% oi 2d. 
sample (ligure 1B) be next consider 
Here, the distance of the fatal area fient 
the mean is .67 sigma units. If the grat eer 
of effect extends for one sigma unit bg as 
the mean, 37.8% of the sample has b 
affected to some degree. 


this would 


In terms of learning scores, nta 


mean that only part of an рш in 
group could be expected to have a chani in 
learning ability. The effect of a derea 
learning ability for part of an ехрепш н 
group would result not only in pes 
mean learning scores, but, also, in ine 
ing variability for the experimental 87012 
as a whole. This is substantiated by ^. 
findings of the present study which are IL 
ported in Table 5. In the case of Gronh, 
where the anoxics suffered consider gy 
mortality, the variance for every leat! r 
variable save one is significantly greate" 
the experimentals. . [init 
It appears, then, that within certain „есі 
and depending upon the gradient of © am" 
the mortality rate in the experimental 5° a 
ple may be used as an index of the t° 


ое” - 
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effect upon that sample. In studies such as 
those of Youngdahl, Hurder and Sanders, 
Meier and Bunch, and Group I of the 
present investigation, where very few or no 
animals died as a result of deprivation, 
minimal statistical differences existed be- 
tween the control and experimental groups 
because the total effect on the experimental 
population was small. There probably were, 
however, individuals in the experimental 
groups who had been affected. The effect of 
the scores of these few individuals on the 
sample statistics was negligible, however. 
To summarize the preceding paragraphs: 
in any species the effects of oxygen depriva- 
tion at birth depend upon the maturity of 
that species at birth and the amount of de- 
privation. More mature species require less 
deprivation to produce a given effect. In a 
sample of any species, the mortality rate of 
the experimental sample can be taken as an 
index of the total amount of affect in that 


sample. 


Summary and Conclusions 


Since the experimental literature ap- 
peared to be equivocal in specifying the role 
of natal anoxia in the etiology, we con- 
ducted this study in which both the severity 
of oxygen deprivation and type of criterion 
task were manipulated. One group of rats 
(№=57) was exposed to à simulated alti- 
tude of 46,000 ft. (2.91% equivalent oxy- 
gen) for 30 min., whereas a second group 
(N=24) was exposed to the same atmos- 
pheric conditions for 60 min. Fifty-four 
littermates served as controls for the first 
group ; 23, for the second. 

At the age of 65-72 days, both groups 
were given pretraining (five trials per day 
for 5 days) on a straight swimway- . | 

Following the pretraining, the animals 
learned (to a criterion of one errorless 
trial) а 14 unit water maze of the Stone 
Pattern, Thirty days after initial mastery; 

et the maze. 
which E 150th day of age, : 
learning only 30 min. depriva! à Shi 
consisted I instrumental response. gts 
food ша. (а) 7 days adaptation, e 
training i Hed Skinner box; (b) dc 

$ in a “pseudo” Skinner box; ап 


he animals 
ion started 
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(c) 3 days training in the Skinner box, per 
se. In the last, the time required for 30 re- 
warded responses was recorded on each of 
the days. 

Extinction of the bar pressing response 
in the Skinner box was started aíter the 
last of the three days for acquisition. This 
continued until the animal made five, or 
less, responses in the daily 20 min. period. 

The animals which had 60 min. depriva- 
tion learned a position habit in a single unit 
T maze when 120 days of age. Two days 
after learning a left turning habit to a crite- 
rion of 10 consecutive errorless responses, 
the Ss were required to reverse and learn 
a right turning habit to the same criterion. 

From the data gathered in this fashion, 
we conclude that : 

1. Thirty minutes of oxygen deprivation 
(2.91% equivalent oxygen) had no signifi- 
cant effect on the learning or retention of a 
multiple unit maze. 

2. Further, this degree of oxygen depri- 
vation had no discernible effect on the ex- 
tinction of a bar pressing habit. 

3. Sixty minutes of oxygen. deprivation 
(2.91% equivalent oxygen) produced sig- 
nificant differences in the learning and re- 
tention of a multiple unit maze. The de- 
prived animals were significantly inferior in 
the number of errors made and in the num- 
ber of trials required to reach criterion on 
both the learning and relearning of the 
maze. 

4. Although this more stringent degree of 
oxygen deprivation produced no demon- 
strable effect on the learning of a simple 
turning response, it did produce a deleteri- 
ous effect on the reversal of that habit. 


А comparison of these results with the re- 
sults of similar studies, some of which re- 
ported positive findings, suggests that cer- 
tain variables are of special importance in 
determining whether positive or negative re- 
sults will be found. The differences 1n 
results among the various studies may bea 
function of the degree of deprivation use 
in the study and the physiological maturity 
of the experimental animals at birth. We 
have suggested that small amounts of de- 


Я А i ü m 
privation, while seriously affecting some 
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few individuals in the population, would 
have little effect upon the statistics derived 
from a sample of that population. Possibly, 
the mortality rate in the sample may be 
validly used as an index of the total number 
of individuals affected in the sample by a 
specific amount of deprivation. 


П. Detayep EFFECTS or NATAL ANOXIA 
ON BEHAVIOR AND 
ELECTROENCEPHALOGRAPHIC ACTIVITY 


Two interests guided the course of the 
research discussed in this section: (a) the 
aspects of behavior affected by natal oxygen 
deficiency and (b) the relative development 
of electroencephalographic activity. 

The aspects of behavior affected by 
natal oxygen deficiency. Experimental 
evidence discussed carlier in this mono- 
graph, as well as that published elsewhere 
(Fazekas et al., 1941) on this problem of 
natal anoxia, has shown that behavioral 
dysfunctions may occur, at least on un- 
selected learning tasks. However, a few ex- 
perimental studies (Becker & Donnell, 
1952; Windle & Becker, 1943) and some 
clinical reports (Despert, 1946; Preston, 
1945; Rosenfeld & Bradley, 1948) indicate 
that selected phases of the total behavior 
may be more severely impeded than others. 
Becker and Donnell (1952) report ап in- 
creased incidence of Stereotyped behavior 
among their experimental Ss. Despert 
(1946) has cited instances of extreme levels 
of anxiety; Preston (1945) and Rosenfeld 
and Bradley (1948) point out the difficulties 
of their clinical Ss in the mastery of tasks 
Which require the use and manipulation of 
Symbols and concepts (e.g., arithmetic) .? 

The S used in this research was the cat 
(a "higher" organism as compared with 
— 


Pa... ап of the clinical studies of this topic have 
sac Ad Positive relationships between neo- 
HE emp and intellectual impairment. In a rare 
Taylor (1955). by Apgar, Girdany, McIntosh, and 
, correlations were 

found between blood oxygen content immediately 


Stanford-Binet 


those used in the other experimentation). 
We felt that stereotypy and symbol forma- 
tion could be adequately demonstrated in 
this S. In line with the clinical evidence 
and that offered by Becker and Donnell, we 
predicted that those cats which had been 
deprived of oxygen at birth would evince a 
lesser variability of behavior and a poorer 
mastery of symbolic tasks than controls. 
We predicted poorer learning abilities on 
those tasks where stereotypy and symbol 
formation would probably not appear. 

development of electroence ph- 


Relative s 
Since the pattern of 


alographic activity. с 
clectrocortical activity immediately follow- 
ing anoxemia is well-known (Libet et al., 
1941), but little is known regarding delayed 
or long-term changes, we initiated а PFO” 
gram of electroencephalographic recordings 
on the Ss used in the behavior analysts. 
These recordings were scheduled to start at 
birth and continue at regular intervals for 
the duration of the experiment. 

We predicted a change in the develop- 
ment of the response frequency: in the 5 
with natal anoxemia, the development of 
the frequency of the cortical activity, 
normally from the slow, irregular waves 9 
the infant to the moderate, regular waves 
of the adult, would be temporally delayed. 
At maturity the frequencies for the two 
groups of Ss were expected to be the same. 


Subjects 


In using cats as Ss, we capitalized upon the 
existing literature relevant to comparative be- 
haviors on stereotypy, simple discrimination, mul- 
tiple mazes, and problems evoking symbol forma- 
tion. Further, by using this 5 we could utilize the 
considerable amount of material on neuroanatomy? 
electrocortical activity, and cortical function. н 

The cats were born of the breeding stock wit um 
the colony at Washington University, Althoug? 
all the young were of known parentage within the 
colony, the strain was far from homogeneous; it 
was that which is commonly referred to as “alley 
cat” variety (a situation which presented some 
obvious difficulties, especially in the statistical anal 
ysis of the results). 

Seventy-three litters (272 kittens) were 
These were born into the colony in the int 
between March 1, 1950 and April 26, 1952. One 
hundred and forty-six of the 272 kittens were ЄХ” 
posed to a deprivation of oxygen at birth. The 


used. 
erim 
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remainder, 126 kittens, served as littermate 
controls. 

Of those exposed to oxygen deprivation, 44 
(30.1%) died, during or immediately after ex 
posure. No single factor other than individual dif- 
ferences in tolerance could be denoted in the ex- 
planation of these fatalities. А certain percentage, 
by no means substantial, seemed to have had in- 
volvements related to nursing prior to the depriva- 
tion, 

Behavioral data from the several tasks were col- 
lected оп 26 of 272 kittens. ОГ these, 14 were 
experimental, i.e, they had been deprived of oxy- 
gen immediately after birth; the remaining 12 were 
the littermate controls. Differences in viability 
between the experimental and control groups did 
not appear once the deprivation and resuscitation 
were complete. Some fatalities appeared within the 
first week after birth (approximately 3% in both 
groups); these are best described as consequential 
to maturation level at birth and difficulties incurred 
during labor and birth processes rather than being 
causually related to the experimentally induced 
difficulties. In those litters where carly postnatal 
deaths appeared, both experimental and control 
groups suffered. The remarkable decrease in the 
sizes of the groups (about 90%) as seen at birth 
and at the time of the completion of the experi- 
ment is sufficient attestation to the existence of 
impediments in the rearing of these animals. (in 
large numbers, at least) through the first year of 


life. 


Apparatus and Procedure 


Deprivation 

exposed to an oxygen de- 
od within the first two 
From work completed 
from preliminary in- 
the severity of the 


The infant cats were e 
ficiency for a single peri 
hours of postnatal life. 
earlier by the authors and 
vestigations with infant cats, ‹ 
oxygen deprivation was set at 2.91% equivalent 
Oxygen (pressure of 416 in. Hg; simulated 
46,000 ft.) for 30 min. An additional 15 min. 
were spent in attaining this condition and another 
15 min, returning to normal atmospheric condi- 


tions. 

The selection of these particul ation: ¢ 
ditions was originally based on certain limitations 
in the equipment available when we were working 
with rats. Contrary to a suspicion engendered 
carly in this study, these conditions did not afford 
a median lethal dose (Т.О) but something less 
(about 112), according to the tabulation at the 
end of the experimentation, (However, continua- 
tion with these deprivation conditions permit com- 
parison of data compiled on rats under these con- 
ditions.) 

In this study, deprivation was achieved through 
the use of a large, cylindrical decompression cham- 
ber (diameter, 54 in.; over-all length, 111 in.; sec 
Figure 2 for details). The decompression was 


ar deprivation con- 


effected by means of a large pump capable of 
creating a "near vacuum" within a matter of a few 
minutes. Normally, however, the chamber was 
evacuated (or filled) at a rate that would be com- 
parable to an ascent (or descent) of 3,000 ít. per 
min. The conditions within the chamber were indi- 
cated on a mercury manometer and a conventional 
aircraft altimeter. The rate of decompression (and 
later, return to normal atmospheric pressures) was 
controlled through a valve sufficient to maintain an 
equilibrium with the vacuum pump at any level of 
decompression, This was indicated on an aircraft 
rate of climb indicator. All instruments were con- 
nected directly to the wall of the chamber at a 
point unaffected by drafts or eddies created by the 
change of air pressure. 

Although the question has been raised repeatedly 
as to whether the decompression per se has some 
detrimental effect on the organism, it was felt that 
the well received work of Paul Bert in 1878 (Van 
Liere, 1942) furnishes conclusive evidence that it 
does not. (At least it has no effect within the 
limits imposed on this experiment.) For this reason 
the authors are convinced that studies in which 
decompression is the mode by which deprivation 
is produced should give results of the same order 
as in those experiments in which the oxygen is re- 
placed by some inert gas, e.g, nitrogen and CO: 
together. The former method has the advantage of 
and readiness of control; the latter, the ad- 
vantage of rapidity of effecting the conditions of 
deprivation. 

After this single exposure to this degree of 
oxygen deprivation, the kittens were returned to 
their respective mothers and permitted to mature 
with their littermate controls. 


Electroencephalography (EEG) 


Shortly before, during, and shortly after the 
period of deprivation electroencephalographic 


m 


n 


Fic. 2. Photograph of decompression chamber 
with two-channel electroencephalograph as used 
for recording during neonatal deprivation. 
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(EEG) recordings were made on the newly born 
kittens. As it was impractical to record continu- 
ously throughout this period (approximately 11 
hrs.) samples were made of 2 min. duration for 
each 10 min. of this entire period. 

Follow-up recordings, of about 3 min. duration 
each, were made the day following the exposure 
(along with the littermate controls and the other 
experimental siblings that may not have had re- 
cordings made at the time of exposure) and at 
subsequent monthly intervals through the fourth 
month. Recordings were again made at the sixth 
month and at the end of the experiment, At this 
time they were at least 10 months old. 

The monopolar recording technique was used. 
Leads were placed on the frontal lobes immediately 
posterior to the orbital ridge and lateral to the 
median line; on the occipital lobes posterior to an 
imaginary line extending from the trailing edge of 
one ear to that of the other, but lateral to the 
median line; and on both ears (the right ear 
served as a reference, the left as a ground) ; six 
electrodes were used in all (Figure 3). Due to 
difficulty in the continued placement of the disc 
electrodes through the movements of the S, 
needle electrodes were employed. (Twenty-seven 
gauge hypodermic needles were cut 4 in. from the 
base of the stem. The lead wires were soldered to 
the flat surface of the base remaining.) EEGs 
were made on a Grass portable two channel re- 
corder (model IID). For the fourth and sixth 
month and final EEG recordings, pentobarbital was 
used to immobilize the animals (dosage: 30 mg./ 
kg. body weight, administered intrapcritoneally). 
Head holders and other restraining devices effec- 


tively stabilized the animals for the earlier record- 
ings.? 


„The EEG recordings served as invaluable in- 
dices of the vitality of the exposed animals. They 
correlated well with the overt behavior at this time. 
When the animals showed a decrease in activity 
(as the lowest oxygen concentrations were 
reached), these records showed decrease in fre- 
quency and amplitude of the wave forms. Through 
the 30 min. at this level, both continued to decrease 
until an extremely low level of electrical activity 
was attained. In some Ss, the depression oblit- 
erated all recordable electrical activity except at 
those infrequent times when the infants gasped for 
air (much less than once each minute). 

The overt behavior of the organisms and the 
concommitant electrical activity evidence the fact 


tion or fatality resulted Pent i oe 

Е " obarbital and other 
barbiturates were not employed at this time because 
of a possible synergistic effect to the anoxemia 
(Himwich, 1951; Snyder, 1949), 


Fic. 3. Representation of head of à kitten to 
illustrate the placement of electrodes for elecroen- 
cephalography with regard to processes on the 
skull. In addition to the four sites indicated (+), 
electrodes were also placed on either ear for refer- 
ence and grounding purposes. 


that the effective period of deprivation extended 
beyond the prescribed 30 min, at the low pressure 
of 66 mm. Hg (2.91% equivalent oxygen). Even 
and regular breathing was not resumed until some 
time after the animals were returned to the litter 
(usually at least an hour). 


Behavioral Tests 


Upon reaching the age of six months, the kittens 
were started on a battery of behavioral tests 
(largely learning) which were, in the order Te 
presentation: (a) Guthrie puzzle box; (b) Sings 
unit T maze adapted for black-white discrimina- 
tion; (c) revision of the Hunter Temporal Maze; 
and (d) feline adaptation of the Wisconsin Gen- 
eral Test Apparatus. The presentation of all the 
learning problems had certain aspects in common: 
The animals were run in blocks of 20 trials each 
day. At this time the cats were 20-22 hrs. hungry: 
The reward in all cases consisted of pieces of 
kidney (about 8 т.). Individual animals were 
trained on the apparatus in small groups in which 


both the experimental and control Ss were 

members. 
The various learning tasks will now be described 
individually : ls 
Guthrie puzzle box. The physical nature of i ps 
piece of apparatus was much as it is describe’ R 
box by Е. К. 


the monograph, Cats im a puzzle БТ nisse 
Guthrie and С. P. Horton тя The een 
mechanism was very similar to that depictec i af 
"D" variation in this monograph. It consiste ter 
а pole 13 in. in length and $ in. т саа 
mounted on а hemisphere of wood 4 in. 1m Qe Я 
eter. Any horizontal movement (at least 4 in. а 
the top of the pole) was sufficient to make elec 
trical contact and activate a solenoid to open tht 
door permitting escape and access to the food (see 


a — 
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Figure 4). The inside dimensions of the retaining 
box (the main part of the puzzle box) were: 30 in. 
wide, 24 in. deep (the floor was an extended half 
circle with a radius of 12 in.), and З in. hi 
Above the gl face of the box was i blackboard 
O} in, high, situated to indicate the the num- 
ber of the trial, and the number of the Y. 

Within the circuit for the escape mechanism, 
contacts were made to operate а сатега (35 mm.) 
which “caught” the cat as it manipulated the pole 
to escape from the apparatus. 

The animal was timed from the moment of entry 
into the retaining box to the time the solenoid was 
activated and the door opened. 

Little preliminary training was 
trials were given to familiarize the animal with the 
goal: a dish of food on the platform in front of 
the escape box. Ten more trials were given to 
goal-orient the animal by permitting him to run 
from the starting compartment, through the retain- 
ing hox, and out the escape door to the food. (The 
pole was not in position.) 

Vive different positions were lable for place- 
ment of the pole: middle, left front, left rear, right 
front, and right rear. The first 50 trials were 
with the pole in the middle position; the next 20 
with the pole in the left rear position; the last 
10 with the pole in the right front position, Thus, 
all Ss had 80 experimental trials. 

The reason for the choice of this particular 
problem stems largely from the article by Windle 
and Becker (1943) in which the authors alluded 
to the stereotypy of the anoxic animals in the 
simple problem used to test their learning ability. 
Guthrie and Horton claim that by analysis of the 
photographs taken at the moment of operation of 
the escape mechanism they were able to infer 
stereotyped behavior to their Уз. (By stereotyped 
behavior Guthrie and Horton meant similarity of 


required. Five 


Fic. 4, Front-side view of Guthrie puzzle box. 
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Fig, 5. Top-side view of T maze discrimination 
apparatus. 


hodily position and contact in the act of manipulat- 
ing the pole.) These two statements were inte- 
grated in the construction and usage of the appa- 
ratus, Both the experimental and the control 5% 
should, according to Guthrie and Horton, show this 
form of stereotyped behavior. However, if Windle 
and Decke метеп is valid, at least for cats, 
the natally asphyxiated group should show this to 
а greater extent than the control group. 

As a further check on this issue, the various 
ions of the pole were uti zed as indicated 
That group which possessed the greater 


pos 
above. 


degree of stereotypy ‘should operate the escape 
mechanism in a new, unfamiliar position less 
readily than formerly. 


Discrimination. problem: Single unit. T mase. 
The apparatus used to test the ability of the cats 
to discriminate. brightness was of conventional 
desigt simple, enclosed, single unit T maze. The 
stem of the T was 40 in. long; half of this length 
comprised the starting box, the other half the 
straightaway to the choice point. A vertically oper- 
ated door divided the starting box from the stem. 
The arms were 24 in. in length. A partition located 
at the junction of the arms with the stem separated 
the arms, one from the other. At this point a brass 
rod was fitted across the stem from which the 
swinging doors (one-way) were suspended (see 
Figure 5). 


The maze was 10 in. wide in the stem and 8 in. 
wide in the arms; it was 10 in, deep throughout, 

Twenty-five preliminary trials sufficed: 5 to in- 
struct the § in running forward from the starting 
hox; 20 to goal-orient him to the left and right 
goal boxes (swinging doors were not in position 
at this time). 
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Subsequent to these 25 trials solid gray doors 
were placed at the choice point, partly to acquaint 
the cat with the operation of the doors, partly to 
teach him left and right position habits (depending 
on his preference, although instructed that reward 
was to be found in both goal boxes). After making 
this choice as to the preferred side (five consecu- 
tive trials to one side), the 5 was trained to go to 
the opposite box in accordance with a noncorrective 
technique. Here the criterion was 10 consecutive 
trials to the opposite goal. 


Upon mastery of these two position habits the 5 
was confronted with the task of learning a bright- 
ness discrimination habit. In this instance the gray 
doors were removed and solid black and white ones 
substituted. The black door we positive for all 
Ss; its position varied randomly from left to right 
according to a prepared list. The criterion of 
mastery was 10 consecutive perfect trials. А maxi- 
mum was arbitrarily set at 300 trials. 


Double alternation problem: Hunter temporal 
maze. The piece of apparatus used in this problem 
was of an original design suggested by EF, H., Pal- 
mer (1950). Palmer conjectured that one of the 
factors that created difficulty in learning Hunter's 
maze was the number of 90° turns that had to be 
made which were not relevant to the cruci 
at the choice point. His results on 
the maze bore out this contention. 

The version of the maze 
ried Palmer's plan to the 
design the outside corne 


а revision of 


used in this study car- 
extreme. In. Palmer's 
Н TS were rounded leaving 
still the corners at the intersection info the stem 
leading to the choice point. The current revision 
consisted of а single T with 10 in. arms and stems. 


Fic. 6. Top-side view of t 
starting box is below small s 
right center of the picture; 
left center. 


emporal maze. The 
egment of are in the 
the goal hox is at the 


Circular segments (10 in. inside radius) joined the 
ends of the arms of the T to the мет. Where the 
two circular segments approached the stem of the 
T, an are of the segments was movable so that it 
could complete the left or the right path (see 
Figure б). The passages were 8 in. wide and 10 in. 
deep throughout. 

The starting compartment was below the move- 
able arc that completed the left and right paths. 
This was maneuvered into position by depressing 
а lever, thus raising the arc and the starting com- 
partment (see Figure 7). 1 

The goal box was an extension of the stem of 
the simple T beyond the choice point. 

One-way swinging doors were placed at the ends 
of the arms of the T and at the start of the stem. 
These the Ss operated. A door was also placed at 
the goal box, concealing it from the animal as he 
entered the choice point. This the experimenter 
operated, Da 

The particular double alternation pablo o 
left-left-right-right (LLRR). Reward follow 
the completion of the entire problem. ее 

At least 90 preliminary trials were requires or 
fore the Ss could attempt to learn the pattern. A 
these, 20 were spent orienting the animals и 
the goal; the animal ran from the starting С 


8 ы 1 i doors 
partment directly to the goal box yei Кес. 
open. The next 10 were the same са was 
the swinging door into the stem о 


closed. 

In trials 31-65 Н 
single choiee at the intersect! 
to the goal box was now 
"correctness" of the response, whethe 


the animal learned to make а 
on of the T. (The doo! 
closed initially.) The 
r left or right, 


Fic. 7. Front view of temporal maze. The start 
ing box is in the lower center of picture. Depres 
sion of treadle raises this box into starting posi- 
tion. 


І 


EIE. - 
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varied randomly according to a prepared list. Dur- 
ing the subsequent 20 trials, and in the 5 following 
them, the У learned to put two and three, respec- 
tively, of these choices together into a single trial. 
Again, the correctness of the responses varied in 
a random fashion, Putting four such choices into 
a single trial became the actual problem that was 
to be learned. 

The criterion chosen for mastery of the problem 
was three consecutive errorless trials! Along with 
the number of trials required to reach this level of 
performance, the errors made on each of the four 
legs of the task were recorded. 

The choice of the double alternation problem is 
attributable to the role of ymbolic behavior that 
has been reported for it, even though this reputa- 
tion has not gone unchallenged (Seward, 1948). It 
was conceived to serve а position in this battery of 
tests analogous to those of symbolic functioning 
used in some of the clinical studies. 

Discrimination problem; General test apparatus. 
The apparatus used here was an adaptation of the 
general test apparatus designed by H. F. Harlow 
for his studies of perceptual learning in monkeys 
(Harlow, 1951). The over-all base dimensions 
were 20 by 40 п. Of this, the Tull width (20 in.) 
and 182 in. of the length comprised the restraining 
box from which the cat manipulated the discrimi- 
nanda, The height of this box was 16 in. А ver- 
tically operating door separated the box from the 
remainder of the apparatus, At this point, inside 
the vertically operating door, two pieces of plate 
glass were so arranged as to form a horizontal 
slit (1% in, high) at the level of the tray (3 in.) 
upon which the discriminanda were placed, This 
tray extended the full width of the apparatus, 
being of itself 10 in. wide. Two 2 in. depressions 
(6 in. between centers) in this tray served as 
receptacles for the food reward. 

A second vertically operating door separated the 
tray from the experimenter, CONCE ing him from 
the S. 

A fluorescent light mounted on the door in front 
of the 5 illuminated the discriminanda and the tray 
(see Figure 8). А 

The cat was given some 20 trials to practice us- 
ing the paws to secure the food from the tray. In 
an additional 10 trials he was instructed in the 
movement of the discriminanda and subsequent ex- 
posure of the food. 

"hereafter the animal was confronted with the 
problem of selecting the black object (32 im. 
squares were used) over a white object to procure 
the reward, A corrective type training procedure 
was used; the 5 persisted until a correct choice 
was made; this constituted a single trial. As with 
the previous discrimination task (single unit T 


+ The criterion chosen for mastery of the prob- 
lem was three consecutive errorless trials. If inter- 
action is absent, the probability that this could 
а s absent, 


occur simply by chance is 1:4096. 


Fic. & Front side view of adaptation of Wis- 
consin General Test Apparatus. 


maze), 10 consecutive errorless trials constituted 
the criterion of mastery. 

The use of a second discrimination problem was 
occasioned by several observations of the single 
unit T mi Watching the experimental animals 
in their fixated behavior in this maze, the experi- 
menters felt that part of this behavior could be 
explained by a differential motivating factor. Pos- 
sibly the experimental animals were not as highly 
motivated for food as were their littermate con- 
trols. On every trial in the maze the animals were 
rewarded at least to the extent of being picked up 
and handled by the experimenter. It was conceiv- 
able that this was sufficient to drive the animal 
forward in the maze, but being the same for both 
the "right" and "wrong" choices did not reinforce 
the animal in the learning of the black-white 
discrimination. 

Such an explanation could also be utilized in 
those instances where the animal did not learn the 
discrimination but still did not fixate (this included 
both experimental and control Ss). 


e 


Results 
Statistical Analysis 


The significance of differences between 
the two groups of animals in their perform- 
ances on the behavioral tests was deter- 
mined by nonparametric procedures. To 
compare central tendencies, the Mann- 
Whitney U Test ( Moses, 1952) was used ; 
for contingencies, Fisher's (1946) exact 
treatment of 2 by 2 tables was used. 
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The preference for the nonparametric 
techniques оуег the more traditional para- 
metric forms lies in certain restrictions of 
the data. Time scores on the Guthrie puzzle 
box were markedly skewed. Error and trial 
scores on the single unit T maze and the 
temporal maze were such that homogeneity 
of variance was lacking. Admittedly, cor- 
rections might have been made for these 
deficiencies by way of such procedures as 
suggested by Cochran and Cox (Edwards, 
1950; Johnson, 1949) and McNemar 
(1949). In any event, even with the loss of 
efficiency inherent in aspects of this analysis 
of results, the statistics used seemed to us 
to be the best choice. 

As stipulated by the nature of the hypoth- 
eses examined here, one-tailed tests of prob- 
ability prevailed (Jones, 1952), 


Subjects 


In the final outcome, the number of Ss 
(N) varied from test to test within the be- 
havioral analysis. Three Ss died after com- 
pleting part of the test battery. Some Ss 
were withdrawn from individual tests be- 
Cause of a failure to respond. (See the final 
paragraphs in this section.) Consequently, 
N varied between 21 and 24. 


Behavioral Results 


Guthrie puzzle box. 
niques were employed with this apparatus. 
The first utilized the latencies of response 
for cach trial. Comparisons were made be- 


Two scoring tech- 


tween the two groups as to the mean latency 
for successive blocks of 10 trials. The trials 
on three tasks were analyzed: (a) the re- 
lease mechanism in the middle of the re- 
straining cage; (5) in the left rear position ; 
and (c) in the right front position. In the 
second technique, the repetitiveness of re- 
sponses for each animal was determined. 
The tabulation was made from the photo- 
graphs taken at the moment of activation of 
the release mechanism. Only the first 50 
trials were considered. (The feeling was 
maintained that the subsequent 30 trials 
would show little in the way of stereotypy 
of response: the rationale for incorporating 
this instrument in the battery of tests. Re- 
views of the photographs sustained this 
belief.) 

Significant differences were reve 
the later blocks of trials on the first task at 
in the first 50 trials, collectively. The per- 
formances on the last task (Trials 71-80) 
also demonstrated statistically significant 
differences. In every case of significant dif- 
ference, the latencies for the controls were 
shorter than for the experimental animals. 
( Only in one of the nine comparisons made 
was the median latency for the experimental 
group shorter than the median for the con- 
trol group.) | 

The study of the photographic nd 
proved unrewarding. In this analysis each 
of the photographs (taken at the time 0 
activation of the release mechanism) was 
considered and rated on several dimensions 
of response. To compensate for any dis- 


aled in 
ad 


TABLE 6 
GUTHRIE PUZZLE Box 
Ti = 
ime Scores === 
Trials: Е " 
rials 1-50 1-10 11-20 21-30 31-40 41-50 51-60 61-70 pem а 
с : * т » catt 
тор С* | Х** | CH | х** | сж | хж | сж х** | се | xer | сж | x** | сж | хак | сж | x** | С* BS 
Median 7.4 | 10.3 | 15.7 | 23.6| 8 20.5 
k . . -8| 7.9|5.1| 9.3|2.9| 7.3] 2.2 4.1| 7.8| 5.4| 8.316.3| 7 
Меп 312 | $6.0 | 35-0 | 116.5 | 9.6 | 36.5 | 6.0 | 15.6 | 6.5 | 19:4 | 5.0 | 11,8 | 7.4 | 13:8 | 27.2 | 15.4 | 6.3| 28-б 
+81 52.8 | 141.3 | 4.3 | 82.8 | 3.0 | 15.5 | 6.3 | 26.2 | 4.7 |5.6 | 15.8 | 3-7 | 15.9 [1.2 [31% 
pW) <.05 «.05 <.0 5 5 <.05 
p(Run) б HE 5205 <.025 
*N = 24. 


## N, the number of Ss used, varies with each 
Note.—Only those probabilities approaching 


problem. This variance was due to the withdrawals necessitated in each instance 
the 19-59% levels of Significance are indicated in this table. 
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TABLE 7 


GUTHRIE PuzzLE Box 
(Percent of responses) 


Response/Total Part/Total Side/Total 2 X /Total 

Group: C x € х С х [c x 
i .29 ‚32 .49 .49 .64 .63 .28 .35 
м "33 136 E 132 ‘64 167 29 E 
SD .15 .18 12 .19 elt .20 412 .16 


Note.— Probability (U)—all greater than .05; № = 24. 


crepancies in the number of trials available 
to this analysis, these tabulations were con- 
verted to percentages and the following in- 
dices were derived: the proportionate fre- 
quency of the dominant localized response 
(the side and part of the body contacting 
the pole), the frequency of a dominant re- 
sponse not considering the side of the body 
involved, the frequency of several responses 
involving one side only, and the frequency 
of immediate duplications of a localized re- 
sponse. No significant differences were 
revealed. 


e———e Control 
х--- -X Experimental 


Median Time 


1-10 11-20 2130 31-40 41-50 


TRIALS 
(in blocks of 10) 


Comparison of performance, in time 


А о groups over the first 50 trials on 


scores, of the tw 
the Guthrie puzzle box. 


Single unit T maze. For the three tasks 
on this apparatus the scoring technique was 
the same: the number of trials to reach 
criterion. For the first position habit, the 
criterion was 5 consecutive responses to one 
side; for the second position habit and the 
discrimination task, the criterion was 10 
consecutive errorless performances. 

Analysis of the performances (in terms 
of trials to criterion) on the two position 
tasks by the Mann-Whitney U test failed to 
show significant differences between the two 
groups. 

As there was an upper limit to the num- 
ber of trials given to each S, enforcing 
an artificial “cut-off” on the distribution of 
scores, and as there was elicited perform- 
ance not conducive to learning the dis- 


criminatory response, the chi square 
TABLE 8 
SINGLE Unit T Maze 
(Trials) 
Task Task 


ist Position 2nd Position 


Group: C х С 2 
Median 20.5 | 30.0 | 28.0 | 28:0 
Mean 20.8 | 40.8 | 26.2 | 31.1 
SD 6.7 | 28.3 | 10.3 | 11.5 
Probability (U) 2.05 2.05 


Note.—N = 23. 
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TABLE 9 


FREQUENCIES OF SUCCESSFUL PERFORMANCE ON 
BLACK-WHITE DISCRIMINATION PROBLEM 


MN 


Not | 
Learn | Learn | Total | р 
Control 8 2 10 | 
Experimental 4 8 їх | 
Total 2 0 


1 | 1 22 pee 


technique was used with the data on the 
discrimination problem. Two separate anal- 
yses were made: (a) the frequencies of 
those who learned and those who did not 
learn were compared for the two groups, 
and the results are given in Table 9: (b) 
the frequencies of those who fixated (as de- 


fined earlier) and those who did not are 
given in Table 10. 


TABLE 10 


FREQUENCIES ОЕ FIXATIONS OF BLACK-WHITE 
DISCRIMINATION PROBLEM 


As indicated by the probabilities given to 
the right of the paradigms, above, both 
analyses demonstrated significant differ- 
ences beyond the .05 level. The frequencies 
of Ss who did not learn the discrimination 
and/or fixated were greater for the experi- 
mental group than for the control group. 

Double alternation problem: Temporal 
maze. The number of trials necessary to 
reach the criterion of three consecutive 
errorless performances and the number of 
errors made in so doing were the scores 
used for this problem. The latter, the num- 
ber of errors made, was recorded in terms 
of the errors made on cach of the four 
"legs" on a single trial. 

When the number of trials were com- 
pared, certain significant differences T: 
peared. In terms of the total number О 
trials to reach the criterion of three con- 
secutive errorless trials, the differences Were 
significant between the .05 and .06 levels, 11 
favor of the control group. The analysis 
was made with the Mann-Whitney U Test. 
This statistic did not demonstrate a signifi- 
cant difference in the comparison of the 


numbers of trials required to reach ris 
е” 


| Non- errorless performance. The difference 
Fixate | fixate | Total b tween the groups on the total number т 
Control 0 10 i trials seemed to be largely attributable to Ей 
арена! 5 1 12 differences in the behavior following hes 
ota 5 17 22 .030 completion of the first errorless рен 
ance. Statistical significance was indicate 
TABLE 11 
TEMPORAL Maze 
(Trials) 
Score 
Total Trials Trials: 0-1 Trials: 1-3 Trials: Perfect 
Group: с х с х c x & X 
Median 182 
Mean ыр 2205 | 970 | 65.5 82.0 | 159.0 | 15.0 820 
SD атл | 119 | 399 | 647 | 93.6 | 167.8 18.7 | He 
ý . 32.7 46.0 47.9 85.3 6.1 11. 
Probabilit: 
di 06 >.05 <.05 >.05 
Note.—N = 19. 
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TABLE 12 
TEMPORAL MAZE 
(Errors) 
Score 
Total Errors Errors 1 Errors 2 Errors 3 Errors 4 
Group: C X C x € х C 3 G S5 
i 9:5 69.0 67.5 205.0 227.5 3.0 9.5 4.0 4.5 
um re is 66.3 64.0 204.7 251.8 6.0 9.3 5.1 5.8 
SD. 97.6 88.0 | 30.0 | 24.4 | 68.9 86.0 | 50 | 3.5 | 4.6 | 5.6 


Probability—all were greater than .05; N — 19. 


beyond the .05 level in the analysis of this 
difference. 

No significant differences in performance, 
as measured in errors made, was found. 
Comparisons were made upon the total 
number of errors made in reaching criterion 
as well as upon the errors made on each of 
the four legs of the task. 


300 


o— Control 


х----х Experimental 


Median Errors 


100 


and 
Leg 


3rd 
Leg 


4th 
Leg 


15} 
Leg 


Segment of Maze 
Fic. 10. Comparison of performances, in error 


scores, of the two groups on each of the four seg- 
ments (“legs”) of the double alternation task. 


Discrimination problem: General test ap- 
paratus. Although used to corroborate the 
other test for brightness discrimination 
(above), the general test apparatus denoted 
no significant differences between the two 
groups. There were discrepancies between 
the medians, in favor of the experimental 
group, but these lacked statistical confirma- 
tion (Mann-Whitney U Test). 

The results on this apparatus bore low 
correlation with the similar data from the 
single unit T maze? АП Ss learned the 
problem; this included even those who fix- 
ated on the earlier task. Some of the most 
rapid learners were those who failed to 
learn the correct solution of the discrimi- 
nation. 

In addition to these data compiled from 
the behavior tests, other impressions of 
group peculiarities were formed. It was 
noted, for example, that the experimental 
animals were more irritable in the learning 
situations. Should some untoward event 
occur, which it inevitably did for both 
groups, it was more likely that these animals 
would “balk.” They would refuse to con- 


s The coefficient of correlation (Kendall's tau) 
for biserial rank order (1948) with these two sets 
of data was +.24. On a test of the null hypothesis 
for the reliability of this correlation, р = 23. 

5 These untoward events consisted of unexpected 
noise within or without the apparatus, unexpected 
resistance of some movable part, e.g., doors in the 


temporal maze, or some unprescribed physical re- 
straint or injury. 


$- -267 
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TABLE 13 
MULTIPLE CHOICE APPARATUS 

(Trials) 

Control Experimental 
Median 47.50 44.0 
Меап 57.80 45 4 
SD 27.9 15.4 
Probability 2.05 


Note.—N = 24. 


tinue operation of the problem impelling 
added encouragement or preliminary type 
training (even though they were in the 
midst of the actual learning trials). 


Further, the impression prevailed that the 
experimental animals were less active and 
responsive in their home cages. The ani- 
mals tended to remain immobile in a corner 
of the cage seemingly unmindful of the 
activity of the other animals (their litter- 
mates) or of the proximity of the experi- 
menter and others concerned with the colony. 


It was apparent, however, that the single 
period of oxygen deprivation had little or 
no effect оп normal sexual functioning. 
Whether it had some effect on the matura- 
tion of these functions could not be inferred 
from the observations. Certainly it did not 


alter the existence of breeding and maternal 
behavior. 


Electroencephalographic Results 


The electroencephalographic data were 
uniformly negative, i.e., no consistent dif- 
ferences could be found between the two 
groups at amy stage of development. Be- 
cause of the wide variability of the elect 
activity between recordings and with 
single recording for a given S, a stati 
analysis of the records on immature Ss was 
not feasible. Within the accuracy of the 
measures, the data on the adults were all 
but identical, and further statistical analysis 


rical 
in a 
stical 


* In those instances where this additional accli- 
mation and pretraining were ineffective, the ani- 
mals were withdrawn from the particular problem. 
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was judged unnecessary. In a a por 
through a random series of records 200 
both groups, an accurate selection of ws 
trols" and "experimentals" could nor 3 
made. (For further data on the META: 
cephalographic recordings, see Appendix A. 


Discussion 


These data indicate that the дашты с 
stereotyped behavior described by тшш 
and Horton (1946) and that describe d 
Windle, Becker, and Donnell (1943, 19 | 
are not necessarily related, certainly = 
identical. According to Guthrie and pee 
if one group of Ss were more me 
typed than the other, then the Ju n 
show a greater proportion of e of the 
havioral patterns in the um, in this 
escape mechanism. It is true thé “epetitive 
study both groups did show the Ars ай 
type of behavior with which ша! wn 
Horton concerned themselves; typ like 
both groups «demonstrated this in 
amounts.? E 
a follows also, although not пеген! 
so, that the stereotyped group wouk kem 
a shorter, or at the very worst, wide 1 
latency of response. The data are Lame. 
contrary. Generally, the experimenta A 
which was hypothetically the more S inf 
typed was the more sluggish in v Win- 

The type of stereotypy discussed P the 
dle and his coworkers was illustrate a 
single unit T maze. A given aser k 
the Ss failed to learn the discriminatto | 
selected one arm of the maze and pu a 
persistently to it. Of course, this wé i 
adaptive response, but one demanding only 
effort and attaining less reward. As jr 
experimental animals demonstrated this im 
of adaptation, the first hypothesis is £ 
substance. 


5 Tt is entirely possible that, had there exi 
satisfactory means of quantifying коа 
would depict variations in the voltage input, 
conclusion other than "no differences" may ap 
been made of the relative electroencephalogt? 
activity of these two groups. eal 28 

9 The series of duplicated responses were E 
long, for any S, as Guthrie and Horton des 
with their Ss. 


= 
——— 
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Two variables, one related to each of 
these problems (puzzle box and T maze), 
may bear consideration for the understand- 
ing of the differences in these two groups 
of animals. In the Guthrie puzzle box, the 
ambiguity of the response-reward relation- 
ship is apparent. The control group learned 
the problem much more rapidly (cf. laten- 
cies on individual trials) than did the ex- 
perimental. It is possible that similarity of 
successive responses 15 unimportant to the 
solution. In the single unit T maze, the de- 
lay in reward and the lack of secondary 
cues are apparent. Again, the control group 
was superior. Should these two variables 
be meaningful in the interpretation of the 
differences between these animals, stereo- 
typy relates to diverse aspects of behavior 
(or, at least, has two diverse operational 
definitions). 

The performance of the animals on the 
double alternation problem verifies е 
notion that the natally anoxic Ss are in- 
ferior in their ability to learn a complicated 
task involving symbol functioning. Al- 
though the groups were not significantly 
different in the rapidity of solution (i.e., in 
r of trials required to make an 
nce), nor in the pattern 
blem (i.e., which leg was 
learned first, second, etc.), they did differ 
in the ability to retain that solution and 
demonstrate that acquisition by consecutive 
errorless performances. The appearance of 
the first errorless trial may pes 
chance phenomenon (р = i ii eae 1 ‘a 
probability of three consecutive randi о 
(p = 1/4096 = 0002) makes untenable = 
explanation of these data on the basis о 


chance alone. | 

1f this double alternation problem is ade- 
quate for an operational definition of sym- 
bolic behavior, then the data corroborate the 
observations of Preston (1945) and Rosen- 
feld and Bradley (1948) : Individuals with 
a history of anoxia during infancy demon- 
strate a deficit in the ability to acquire and 
sustain the usage of symbols in their adap- 
tive behavior. 

Statistically, the results on the general 
test apparatus are such that the null hypoth- 


the numbe 
errorless performa 
of learning the pro 


esis cannot be rejected. The differences be- 
tween the two groups of Ss were not so 
large (nor in the proper direction as speci- 
fied by a one-tailed test of significance) as 
to be significant at the usual levels. Never- 
theless, a sizable difference (medians, 8%; 
means, 2196) did exist suggesting that the 
experimental animals learned the discrimi- 
nation more readily than did their controls. 
Whether further experimentation will con- 
firm these differences or the null hypothesis 
will have little import on the over-all con- 
clusion from these data on the discrimina- 
tion and stereotypy (as defined in the single 
unit T maze) : The experimental Ss adapted 
in a manner not wholly alike that of their 
controls. The failure to learn a brightness 
discrimination on the single unit T maze was 
not effected by an inability to detect differ- 
ences in brightness discriminanda. 

In their entirety the electroencephalogra- 
phic data were disappointing. This conclu- 
sion stems not so much from their failure 
to confirm or reject one of the hypotheses 
as from the difficulty of instrumentation 
and analysis which they engendered. Un- 
usual frequencies appeared in both groups 
to an equal extent. These were ostensibly 
consequential to difficulties in the placement 
of the electrodes (identity of position, in 
particular), in technical control (electrode 
resistance, movement of the skin, or pres- 
ence of underlying muscle groups and si- 
nuses), and in the inability to maintain 
a quiescent S without extensive reference 
to anesthetics or other drugs. For the 
younger Ss, the frequencies were widely 
variant within a single recording. When 
repeated recordings were made, this vari- 
ance was increased. All in all the immature 
EEGs were remarkably unstable (as judged 
by the type of analysis used here). 

With adult recordings, the frequencies 
were very much more regular. Frequencies 
and amplitudes found at one point in the 
recording were usually found throughout. 
Repeated recordings on a given S were ex- 
tremely similar. Only at this time did the 
data appear reliable. At this time no dif- 
ferences between the groups were found— 
an expected result. 
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In general, the offhand, unquantified ob- 
servations on the relative behaviors support 
a hypothesis which explains the origin of 
some, maybe all, of the tested behavior dys- 
functions in terms of activity levels. This 
hypothesis states that for the development 
of normal, adaptive behavior (including 
functioning in contrivances of the psychol- 
ogist’s sort) a level of responsiveness must 
be maintained. Especially must this level of 
responsiveness be maintained during the 
early stages of development. Physiological 
insults, as oxygen deprivation, alter this 
responsiveness, but only temporarily. Cell 
changes may follow the insult ; the cells may 
degenerate and die but are, shortly, removed 
and replaced (e.g., cortical neurones may be 
destroyed by the oxygen deprivation to be 
replaced, later, by glial cells; the usual fail- 
ure to find specific, localized, permanent, 
cerebral histopathology gives empirical sup- 
port to this hypothesis). Nevertheless, this 
transient effect of the insult may have far- 
reaching influences should the period of un- 
responsiveness be sufficiently long and occur 
at a critical point in the organism's growth. 
( Certainly, neither the length of the period 
nor the point of its occurrence, the critical 
point, can be specified at this time.) 

Surveying the whole of the experimental 
design and the results gained, we feel that 
the material, as well as the problems re- 
viewed by Royce (1950), have considerable 
methodological significance 
tion of this nature. Using a well-formu- 
lated, standardized battery of tests with 
known intercorrelations denoting concep- 
tualized factors (with operational signifi- 
cance), the experimenter could greatly fa- 
cilitate the analysis of certain probl 
physiological and/or developmental 
ter (e.g., natal anoxia, prenatal 
irradiation, hypoglycemia, 
ес.). Experimental insults, 
upon the 55 in this study, 
general or a multitude of 
on the organism. To hav 
tery designate these effects and clarify their 
causal relationships with such variables as 
age of the S and length, frequency, and 
severity of the insult would alter the iso- 


in experimenta- 


ems of a 
1 charac- 
and natal 
hypothermia, 
as imposed 
may have a few 
particular effects 
€ such a test bat- 
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lated existence that such problems as these 
maintain in the field of physiological psy- 
chology, im toto. It is plausible that this 
methodology would extricate from these 
data such relevancies as would tend to insti- 
gate meaningful theorization across the 
demarcations of specific experimental prob- 
lems and specific species of animals. 


Summary and Conclusions 


Behavioral measures were made on 26 
cats to evaluate hypotheses concerning the 
effects of natal anoxemia on learning, 
stereotypy, symbol formation, and an 
development of patterns of а рее 
ographic activity. Data of both ане вея 
experimental origin were presented | offects 
cate the definite possibility of market x. ot 
of anoxemia at birth upon these forms 
behavior. : 

Of these Ss, 14 were exposed UD Has 
deprivation immediately after „ж 
means of a decompression chamber ї [д 
isting atmospheric conditions were ret er 
to 2.91% equivalent oxygen under W ais 
the animals were maintained for 30 d at 
EEGs were collected at this time pas a 
monthly intervals thereafter. With ue d 
ception of the initial recording, cone 
recordings were made on the remaining 
Ss, the littermate controls. 

At the age of six months the behav a 
the animals was observed on a Е o 
tests : Guthrie puzzle box, single unit Tn "€ 
temporal T maze, and a general test apie 
ratus. The cats were to learn a wn 
brightness discrimination habit on the "m 
ond and fourth tests and a double alte" 
tion habit on the third. 

These conclusions were drawn fro! 
observations : 


ygen 
Ву 


ior of 


ү the 


1. Тре degree of oxygen deprivation - 
curred in this study altered the Ss’ bahay е 
in such a way as to significantly ae 
the rapidity of response in the puzzle 0^. 

2. These animals demonstrated an M 
creased propensity towards poorly adap at 
responses. They tended to "stereotypt han 
certain discrimination tasks; rather еу 
learn the proffered discrimination. t" 
would learn a position response. 


ANOXIA, BEHAVIORAL DEVELOPMENT, AND LEARNING ABILITY 25 


3. These animals did not manifest differ- 
ences in ability to learn brightness discrimi- 
nations in situations in which there was a 
close spatial and temporal proximity of re- 
sponse to discriminanda and reward. 

4. They evinced differences in ability to 
cope with symbol formation, specifically, in 
their ability to sustain а symbolic solution 
after once the solution was gained. 

5. They did not show differences in elec- 
trical activity (ЕЕС) as seen at adulthood. 
( Difficulties in instrumentation and analysis 
prevent conclusion on the development of 
these patterns.) 


Tue Errecrs or PRENATAL NOXIA 
on LEARNING IN Rats 


IL 


Although the cellular action of an oxygen 
deprivation prior to birth may be the same 
as similar deprivation later (Schreiber & 
Gates, 1938), the differences in the stages 
of development may be a significant factor 
circumscribing alternations in the organ- 
ism's ultimate level of functioning. There 
is considerable evidence ( Breslin, 1956; 
Ingalls, 1947a, 1947) that insults at speci- 
fied periods in the fetal life will effect 
changes in tissues which are developing 
maximally at that time. Some evidence 
(Meier & Menzel, 1955) has appeared in- 
dicating that such insults may lead to spe- 
cific alternations in behavior as well. 

In order to appreciate the significance of 
physiological insults upon the prenatal T 
ganism, such as oxygen deprivation, the 
and scope of the developmental 
activities in this period must be emphasized. 
This is the time in which occurs most of the 
growth and development of the organism— 
more than the sum total of such activities 
in the remaining vastly more lengthy span 
of its life. Most of the characteristics for 
physical and behavioral development are 
established during this short interval. 

The study to be described here was de- 
signed to investigate the effects of prenatal 
oxygen deprivation and, further, to relate 
the nature of the effect to the age of the 
prenate at the time of the deprivation. Spe- 
cifically, the abilities of two groups of ani- 
mals to learn and retain a simple and a 


significance 


complicated maze pattern were measured. 
The first group, the experimental, was born 
to mothers which had been exposed to an 
oxygen deficiency via a decompression 
chamber. The second, the controls, was 
born to mothers of the same stock as the 
first but which were maintained under 
normal colony conditions for the entirety of 
the gestation period. 


Part I 
Subjects 


A population of 80 female albino rats (Wistar 
strain) from the Buckeye Laboratories in Ohio 
was the source of the 55 observed in this study. 
Randomly, half were designated for the control 
and half for the experimental groups. АП were 
housed for an appropriate interval with a small 
group of males of the stock (Wistar strain) in 
the colony in the Psychology Department at Wash- 
ington University. The housing of the males with 
the females was done in such a way that the sires 
of the experimental offspring were also the sires 
of the control offspring. 


Apparatus and Procedure 


Deprivation. Following a positive test for preg- 
nancy (by palpitation), the 40 experimental fe- 
males were placed in a large decompression cham- 
ber and exposed to a simulated altitude of 30,000 
ft. (equivalent percentage of oxygen, 6.21; sea 
level, 20.93) for 120 min. (The pressure in in. 
of mercury was 8.88 as compared to 29.92 in. at 
sea level) The rate of ascent and descent was 
3,000 ít. per min. The temperature was main- 
tained at approximately 80° F, at all times during 
exposure. 

Of the 40 animals submitted to the exposure, 12, 
or 3096, died in the chamber. Based upon observa- 
tion through the window of chamber, it was 
thought that all of these animals died within the 
first 30 min. of exposure. One animal aborted and 
devoured her young during the first 10 min. of 
exposure. 

Post-mortem examinations revealed that 2 of the 
12 animals which died were not pregnant. The 
post-mortems revealed, further, that the ages of 
the fetuses at death ranged from an estimated 3 
days to an estimated 8 days, 

The living animals, following a short recovery 
period upon removal from the chamber, showed 
no overt physical disturbances, Moreover, this was 
true of the 2 deprived animals which died within 
5 days and the 3 others which died with 10 days 
of exposure. Thus a total of 39% of the pregnant 
females died during deprivation or within 10 days 
following. 


In the control group, 3 of the 40 animals died. 
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The total number of animals born to anoxic 
mothers was 180. Of these, 32 died or were de- 
stroyed shortly after birth by the mothers. The 
first 130 animals born were used in the behavior 
measures and the remaining discarded. To the con- 
trol mothers, 172 young were born, and of these 
35 died or were destroyed by the mothers, The 
first 130 born were used as the control group in 
the behavior measures. 

The 130 young of the anoxic mothers were born 
on an average of 7 days following exposure, with 
the extremes being 2 and 11 days. Since the action 
of the oxygen deprivation upon the duration oí 
gestation is not clear, no inferences can be drawn 
as to the exact fetal age at the time of exposure. 

Behavior measures. The test situation utilized in 
measuring the animals’ behavior included a straight 
water tank (a straight "swimway"), a fourteen 
unit T water maze, and a single unit T water maze. 

The straight swimway served a dual purpose: to 
pretrain the animals for the mazes and to discern 
Swimming speeds of both the anoxic and control 
groups. The tank used was 108 in. long, 8.5 in. 
wide, and 15 in, deep. The depth of the water in 
the tank was 9 in. 

The animals were placed in a 
over the water at the starting enc 
then dropped 3.5 in. to the water through the 
hinged bottom of the box. The animals then swam 
to the ladder and goal platform (and warmth) at 
the Opposite end of the tank. Each rat had 10 
trials in this task on the 59th day after birth, 

On their 60th day of age, the animals of both 
the control and the experimental groups were 
weighed. One hundred of each group were started 
on the multiple T maze. 

The maze used was a 14 un 
Stone pattern) filled 
In the Scoring proc 
Were recognized an 


1 box suspended 
1 of the tank and 


it T maze (modified 
with water to a depth of 9 in. 
edures, three types of errors 
е d recorded, along with time and 
trials to criterion, "The three types of errors scored 
were as follows: retracing in a section of the cor- 
Tect pathway, entering a wrong Pathway, and enter- 
ing either end of a cul-de-sac. Duplicated errors 
were individually recorded. Five gates were located 
at points as specified in an earlier report from 
this laboratory (Licklider & Bunch, 1946), 
‚ Each animal was given three trials a day until 
it reached a criterion of one errorless trial. Any 
animal spending more than 30 min. in the water 
in any one trial was removed from the situation 
and discarded from the experiment. On this basis, 
‚ anoxic animals and one control animal were 
discarded, 
ginal learning of the 
been сева these animals re- 
ze to the iteri 
Fi di mum Same criterion (one 
In the 30-day interval between the learning and 
relearning of the multiple unit maze, half (50) of 
the animals from each group of 100 learned а 
brightness discrimination problem in a single unit T 
water maze. The animals were run on the single 
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unit maze ll days after they had reached the 
criterion on the 14 unit maze. " 
The length of the stem of the T maze was 42 x ; 
the length of each arm of the T was 18 in. т 
ends of the arms of the T were extended ике 
6 in. parallel with the stem, making it Mice 
see the escape ladder from the choice point. т 
water was maintained at a depth of 9 ш, iod 
temperature of the water was 5-6 degrees Е. be 
the temperature of the room. Е 
At the choice point of the T, а opu 
placed separating the arms. А rod passec ^ d 
this septum and fastened to the sides of Ше ae 
was used to hang the black and white ка pu 
positions of Ше gates were varied, randori у. Мет. 
escape ladder was placed on the side of the ud 
rect” gate. The gates were free swinging, ДЕ 
ting the animal to return to the choice point û 
making a "bad" choice. | 
Each animal was given 80 trials t 
tending over an 8 day period, viz. 8 
trials. 
An additional 30 anoxic and 30 control previous 
learned the discrimination problem withou TH 


as 


at this task ex- 
blocks © 


animals 


se 
БЕ S : naze. 

experience on the multiple unit T اوم‎ only) 

rrOTS, f 

performances (scored by trials and € the basis 


were used to differentiate the groups oF г 

of transfer of training from the maze lea 

the brightness learning. А 
Most animals completed the learning ай 

learning tasks by the 100th day of age. 

mals were weighed at this time. se group’ 
The schedule of testing with the various 8 

is shown in Table 14. 


ning 


d the T€ 
А ап!" 


Results 


ce 
> »ferene 
The results are presented with ref 


to the following questions concerning 
effects of prenatal exposure to OYE 
ficiency upon learning and тен, іл 
maturity: (a) ability to learn and to 

the solution of a difficult maze problem tal 
relationship between the effects of рге ей 
anoxia on learning and the interval bet? “tb 
the day of deprivation and the day of 
(c) ability to learn a simple brightness is. 
crimination problem, (d) transfer of t pd 
ing from one learning task to a secon’ ей 
(e) inhibition (or facilitation) of ™ pe 
polated learning upon relearning Бе pos; 
original task. Because of the imminen 515 of 
sibility of interaction effects, the analy паї? 
the data also concerned questions of pe e 
“side” effects of prenatal anoxia, that pr 
effect on body weight, sex ratio, and wi 
of performance, specifically, speed in $ 
ming the length of a straight tank. 


1 
f 10 
пе 

to 
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TABLE 14 


PROGRAM ОЕ CONDITIONS OF LEARNING FOR EACH GROUP 


Conditions of 
Group No. Learning 30-Day Interval Relearning 
Control 100 Mult. T Maze (50: rest ) Mult. T Maze 
(50: learn B-W disc.) 
Anoxic 100 Mult. T Maze (50: rest ) Mult. T Maze 
(50: learn B-W disc.) 
Control 30 B-W discrimination 
B-W discrimination 


Anoxic 30 
| 


The effect of prenatal anoxia upon per- 
formance of learning and retention on diffi- 
cult maze problem. The anoxic animals 
made 37% more errors in learning this 
maze to criterion (one errorless trial) than 
did the control group. The mean error score 
for the anoxic group was 226.33 and for the 
control, 164.61. The mean difference 
(61.72) was significant beyond the .01 level. 

There were additional differences between 
the groups as related to the sex of the mem- 
bers. In superiority of performance (few- 
est errors), the control males were best; the 
control females, anoxic males, and the 
anoxic females followed in that order. The 
differences between the control males and 
the control females and between the control 
males and the anoxic males were significant 
beyond the .01 level. The difference be- 
tween the control females and the anoxic 
females was significant beyond the .05 level 


(one-tailed test ; the order of difference was 
as expected). 

The anoxic animals required 30% more 
trials to reach the criterion of mastery than 
did the controls. The mean trial score of 
the anoxic groups was 25.23, whereas the 
corresponding score for the control group 
was 19.35; the difference between these 
means was significant beyond the .01 level. 

As with the error scores, differences in 
trial scores also existed between the groups 
as related to the sex of the members. The 
nature of the differences was about the 
same: the control males were superior (re- 
quired the fewest number of trials); the 
control females, the anoxic males, and the 
anoxic females followed in that order. The 
differences between the two groups of males 
and between the two control groups (male 
and female) were statistically significant 
(.01 level). 


TABLE 15 


ANALYSIS OF Error Data ON 14 Unit Maze 


Group No. Group Mean 
Anoxi 100 226.33 
Contrl 100 164.61 
Anoxic 39 241.85 
Control 48 194.85 
Anoxic 61 216.41 
Control 32 139.69 
Anoxic 61 216.41 
Anoxic 39 241.85 
Control 52 136.69 
Control 48 194.85 


} р 
4.02 <.01 
1.89 <.05 
4.24 <.01 
1.01 >.05 
3.15 <.01 
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TABLE 16 


ANALYSIS OF TRIAL Data ох 14 Unit Maze 


Group No. Group Mean | oMd Sex | t "n P — 
EP ‚01 
Апохїс 100 25.23 | 1.82 = | 3.23 < 
Control 100 19535 | Roth š >.05 
Anoxic 39 27.05 | 2.81 | B | 1.59 
Control 48 22.58 | 4 <.01 
Anoxic 61 24.07 2.31 M | 3.33 
Control 52 16.37 N | >.05 
Anoxic 61 24.07 2.98 M | 0.90 
Апохїс 39 | 27.05 | Е | п <.01 
Control 52 16.37 | 2.30 M 2.70 
Control 48 22.58 Е | m 
n au OL 
- " А : | tions O 
The analysis of the time scores (time per ferences between the two popula 
trial) presents the same picture of inferi- male animals. „кые. multiple 
ority of performance of the experimental Retention of the solution ot peg m fol- 
animals. The anoxic group averaged 1.26 unit maze after an interval of 3 / ndi sig- 
min. per trial as compared to the 1.16 min. lowing original mastery differentia menta 
averaged by the control group. The differ- nificantly, the control and the expe! 
ence between these averages was statistically groups. о only] 
significant (beyond .01 level). Once again, For this comparison of behavior. : 
further di 


fferences were noted when the two 
Sroups were subdivided according to the sex 
of the members. The control males were 
significantly faster than the control females 
(.05 level) and the anoxic males (.01 level). 
As determined by performance 
multiple T maze, the differences betw 
8Toups were consistent for all m 
used: for errors and trials to reach criterion 
and for average time per trial. In each in- 
stance, the control males were superior to 
the control females, anoxic males, and 
anoxic females. In each instance, differ- 
ences between the control and experimental 
animals were attributable, largely, to dif- 


on the 
een the 
easures 


“меб 
half (50) of cach of the two groups Won 
used. (The remaining 50 of each cts 0 
Were used for comparison of the ae 
interpolated learning.) The design а! 
sults are shown in Tables 17 and 18. Е 
In all cases the raw relearning score 
the control group were superior сей 
scores of the anoxic group. The latter tria 
21% more errors and took 14% more тте! 
in relearning the maze than did the ind i 
the control group. Only the pe in 
trial scores were significant statistical у: he 
These results might be a function ° ох 
greater amount of practice for the але 
group in the original learning problem. 


те 


s of 
the 


TABLE 17 


Comparative RETENTION D. 
NORMAL (“Rest”) С 


ATA FOR CONTROL AND ANOXIC ANIMALS UNDER 
ONDITIONS FOR AN INTERVAL oF 30 Days 


(Error Scores) 


Conditions during А 
Group No. Interval Group Mean c Md i p 
Anoxic 50 Best z 05 
Control 50 Ил 30.26 6.06 1.11 


23.52 


na 
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TABLE 18 


COMPARATIVE RETENTION DATA FOR CONTROL AND ANOXIC ANIMALS UNDER 
NORMAL (“REST”) CONDITIONS FOR AN INTERVAL OF 30 Days 


(Trial Scores) 


Conditions during 


Group No. Interval Group Mean c Md 1 РА 
Anoxic 50 Rest 6.50 0.88 d. «.05 
Control 50 Rest 4.94 


anoxic animals made 4696 more errors and 
required 3496 more trials in the original 
learning situation. In any case, savings 
scores for the two groups were almost iden- 
tical (median saving score in trials for con- 
trol group, .80; median for anoxic group, 
78). 

Relationship between the effects of pre- 
natal anoxia on learning and the interval 
between the day of deprivation and the day 
of birth. The data indicate significant dif- 
ferences in the ability to learn the maze as 
a function of the developmental stage at the 
time of oxygen deprivation. A summary of 
this analysis (analysis of variance) is given 
in Table 19. 

A further review of the data revealed, 
however, that the bulk of the differences 
noted between the various experimental sub- 
groups and between the total experimental 
group and the control group was contrib- 
uted by a single experimental subgroup : the 
subgroup born 10 days after exposure ons 
posed, possibly, on the 13th fetal day). «à 
subgroup differed significantly from each 


of the other experimental subgroups and 
from the control group. No differences ex- 
isted among the other experimental sub- 
groups. Їп a further comparison between 
the control and the anoxic animals, with the 
subgroup born 10 days after exposure de- 
leted, significant differences remained al- 
though the level of significance was reduced 
(from .01 to .05).!* 

Ability to learn a simple brightness dis- 
crimination problem. Two groups of 30 ani- 
mals, experimental and control, were com- 
pared to determine the effect of prenatal 
anoxia upon simple black-white discrimina- 
tion learning. Neither group had been used 
previously on any learning problem. 

The control group made more correct re- 
sponses in the block of 80 trials than did 
the experimental. The mean number for 


10 Only the initial learning performance on the 
multiple maze could be evaluated in this fashion. 
Elsewhere the data were too meager to permit a 
subdivision according to fetal stage at time of 
deprivation. 


TABLE 19 


ANALYSIS OF VARIANCE OF Anoxic GROUPS WITH RESPECT TO THE 
INTERVAL BETWEEN Day оғ EXPOSURE AND Day or BIRTH 


Age Groups Measure df Sum of Squares Variance F р 
n Initial Learn. 8 280125.27 35015.66 2.69 <.01 
Wine Errors 91 1183842.84 13009.62 
Between Initial Learn. 8 3654.52 456.82 2.67 <.02 
| Within Trials 91 15557.19 170.96 
| Between Relearning 8 2613.97 326.74 0.53 2.05 
Within Errors 91 62179.87 
Between Relearning 8 165.84 20.73 1.27 2.05 
| Within Trials 91 1506.91 16.38 
i 
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Mean Errors 


1109876543210 
Day of Exposure Prior to Birth 


Fic. 11. Relationship between d 
to oxygen deficiency prior to birth and perform- 
ance on the multiple T maze. For comparative 
Purposes, the two points on the right, Group 1 and 


oup 2, are presented from the first study re- 
ported in this monograph, 


ay of exposure 


the control group was 50.93 and for the 


anoxic group, 47.17, These do not differ 
Significantly, however (р< .05). 

Transfer of training from one learning 
task to а second. Eleven days after they 
had reached the criterion on the multiple 
unit maze, 50 control and 50 anoxic animals 
Were given 80 trials on the black-white dis- 
crimination task in the single unit water 
maze. The animals in both the anoxic and 
control groups, which were to be used on 
the discrimination problem, were selected 


TABLE 20 


ANALYSIS Ор Data on 
Task: INIT 


SIMPLE Discrimination 
TAL LEARNING 


Anoxic 


Mean Trials 
m 
© 


10 
1110 98765432 


Р irth 
Day of Exposure Prior to Bir 


t px posure 
Fig, 12, Relationship between day Perform 
to oxygen deficiency prior to birth am d pur 
ances on multiple T maze. For нд T and 
poses, the two points on the right, s study © 
Group 2, are presented from the first $ 
ported in this monograph. 

‘inal 1612 
prior to the beginning of the origina а was 
ing on the complex maze. The selec 

| 


the 
regarded as random with petere, to the 
learning ability of the animals anc ment 
subgroups related to the develop 
stages at the time of deprivation. 


The results of their performances E А 
second learning problem, measured M © 8 
oi correct responses in the block ont 
trials, are given in Table 21. The ber (а 
animals were superior to the exper! with 
in both situations, that is, in situations, inf 
and without the benefit of previous lea 
experience (see Figure 13). . m 

On the black-white discrimination T an 
lems, after maze experience, the eom j 
the anoxic groups made 51.30 an di 
Correct responses, respectively. This 


fel 


у 
54 
nce? ў 
11 As it turned out, there were po rep” 
the learning on the initial task and EN roup | 
sentation of the several experimental subg 


ANOXIA, BEHAVIORAL DEVELOPMENT, AND LEARNING ABILITY 31 


TABLE 21 


ANALYSIS OF DATA ON SIMPLE DISCRIMINATION Task: INTERPOLATED PROBLEM 


Group Group Mean Sex c Ма 1 P 

(CR) 

Anoxic 45.18 Both 1.65 3.71 «.01 

Control 51.30 Both 

Anoxic 43.56 F 2.08 3.21 «.01 
50.14 F 

| eme 46.80 M 2.50 2.56 «.01 

Control 53.21 M 


ence in discrimination learning, as before, 
favors the control group. This difference 
was significant at the .01 level. It appears 
that the ability to utilize the experience on 
the complex maze in learning of a discrimi- 
nation problem, as well as the ability to 
master the complex maze, initially, was in- 
ferior in the anoxic animals. 

Effects of interpolated learning upon rc- 
learning the original task. The conditions 


х----х Anoxic 


E xx Transfer 
Anoxia 


9 
8 
3 
в 7 
Е 
о 
а 
vo 
È 6 
= 
ЕЧ 
5 5 
S o—— Normal 
5 +----* Transfer 
4 
5 Normal 
Q 
E 
3 
z 
с 
o 
Ф 
> 


5 Ө 8'Э 
DAYS 
(10 trials per day) 


12.8 4 


Fic. 13. Role of transfer of training among the 
control and experimental groups in the learning of 
the simple discrimination tasks. 


of interpolated learning were included in 

the experimental design in order to deter- . 
mine the relative susceptibility of the anoxic 

and the control animals to reproductive 

interference. 

The brightness discrimination problem, 
the interpolated task in the 30-day interval 
between the learning and relearning of the 
multiple unit maze, did not inhibit the re- 
learning performances; instead, both the 
anoxic and the control groups demonstrated 
facilitation on the second exposure to the 
maze. The differences in relearning scores 
between the anoxic "rest" group and the 
anoxic "interpolated" group was significant 
beyond the .05 level. However, a compara- 
ble difference (.05 level) existed between 
the two anoxic groups on the original learn- 
ing of the multiple unit maze. This existed 
despite a random selection of animals prior 
to any learning experience whatsoever. This 
latter difference tends to disclaim the reality 
of any difference between the two groups as 
related to interpolated experience. 

Facilitation instead of inhibition from the 
interpolated learning may be explicable on 
the basis of a number of factors: the rela- 
tive dissimilarity between the maze and the 
discrimination tasks in the specific stimulus 
properties critical to learning and upon 
which interference might depend; the rela- 
tively high similarity between the two learn- 
ing situations in terms of the general condi- 
tions of practice and performance involved, 
on which facilitation might depend ; and the 
mere occurrence of daily exercise, handling. 
or performance, even apart from learning. 
in a similar motivating situation for almost 
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TABLE 22 
EFFECT OF INTERPOLATED LEARNING ON RELEARNING 
(Errors) 
| Conditions during Group | 
Group | №. Interval Means в Md П | р 
| И E — Ux [opem 
Control 50 |  B-W discrimination 17.4 | жов | 12 | 2.08 
Control | 50 | Rest 25.52 | А 
Апохіс 50 B-W discrimination 19.66 5.03 2.11 <.05 
Anoxic | 50 Rest 30.26 | 
| | d 
1 of the 30-day interval which was not frequency magnified for the experimental 


available to the "rest" groups. 


“Side” Effects 


Analysis of weight data. The mean 
weight of the anoxic group as а whole 
differs from the mean weight of the control 
group as a whole only on the basis of 
chance; no significant weight differences 
existed either at 60 or 100 days of age. A 
breakdown of the groups according to sex. 
however, revealed the significantly lighter 
weight of the anoxic females, as compared 
to the control females, at both dates. The 
anoxic and control males did not differ at 
any point. (See Table 24.) 

Sex ratio. The subdivision of groups by 
Sex was as follows: 


Multiple Brightness 
T Maze Discrimination "Total Я 
Male Female Male Female Male Female 
Control 52 as 15 5 0 6 
Anoxic 61 39 20 10 81 49 


Although in most instances there were more 
males than females, with the superiority in 


groups, no significant differences existed. 

Speed of swimming in the straight tank. 
From the any differences т 
swimming speed, one may conclude that 
differences in learning performances of the 
anoxic and control groups cannot be attrib- 
uted to this factor. The mean of the control 
group was 6.43 sec. and, of the anoxic 
group, 6.74 sec. per trial for the block of 
10 trials. (The critical ratio on these data 
of .92 was insignificant.) 

Even though the anoxic females were 
smaller than their control counterparts, as 
indicated by body weight, they were not 
materially affected in this aspect of loco- 
motion. 


absence of 


Discussion 


The primary result of this study is that 
animals exposed to severe oxygen depriva- 
tion for a short period of time during their 
fetal life are impaired at maturity in their 
ability to learn and to retain (after a 30-day 
interval) a complicated maze pattern. The 


TABLE 23 
EFFECT or INTERPOLATED LEARNING ON RELEARNING 
(Trials) 
Conditions during Group 
Group No. Interval Means c Md t р 
Control 50 B-W discrimination 4.22 
і 73 .9 .05 
Control 50 Rest 4.94 NS es М 
Апохіс 50 | B-W discrimination 4.00 0.79 3.16 2.01 
Anoxic 50 | Rest 6.50 


ANOXIA, BEHAVIORAL DEVELOPMENT, AND LEARNING ABILITY 33 


anoxic animals were inferior to the control 
animals in all measures of ability to learn 
and to retain the maze and to learn a dis- 
crimination problem, although in some in- 
stances this inferiority was not a statistic- 
ally significant one. 

The results of the initial learning on the 
14 unit water maze were in the same direc- 
tion as those results reported by Hurder and 
Sanders (1953) and Meier and Bunch 
(1950). In each of those studies, however, 
the deprivation occurred shortly after birth. 
In general, the results of this experiment 
indicate greater differences between the 
anoxic and control groups than in the other 
studies mentioned. It is impossible at this 
time to state the basis of the differences in 
magnitude between this and the other 
studies; it could be the prenatal (especially 
deprivation at a particular. developmental 
stage) versus neonatal conditions ; it could 
be a difference in the severity of the depri- 
vation, per se; and so on. | 

Meier апа Bunch reported that anoxic 
animals do not perform as well, according 
to some measures, as control animals on a 
single T water maze with respect to ко 
of training in learning a position habit. t 
can be added from this study that animals 
ygen prenatally: ( a) do not 
jl as control animals on а 
when ability to dis- 
asured and 


deprived of ox 
perform as well 
single T water maze € 
criminate brightness is being me 


(b) show less ability to profit from previous 
learning of a maze problem. 

The results on the relearning of the maze 
were not consistent with those reported by 
Meier and Bunch (1950), Windle and 
Becker (1943), and others reported in this 
monograph. Meier and Bunch reported that 
learning was relatively unaffected by neo- 
natal deprivation, but that retention follow- 
ing 2 or 14 days was markedly altered. This 
study found that original learning is much 
more severely impaired than is retention. 
Retention here was measured by relearning 
(Meier & Bunch used both relearning and 
reversal of habit), which was started 30 
days after the completion of the original 
learning. It is possible that the presence of 
significant differences may be a function of 
the length of the interval over which reten- 
tion was measured. 

A factor of unknown importance was the 
difference between the groups when sub- 
divided by sex. Although not significant, a 
difference did exist in the frequency distri- 
bution of females and males between the 
control and experimental groups. In every 
instance where a comparison was possible, 
the males considerably outnumbered the fe- 
males in the anoxic groups, but were almost 
equal in number in the control groups. 
Along with this, in each instance where 
significant differences were found between 
the experimental and control groups, the 


TABLE 24 
ANALYSIS OF WEIGHT DATA 
+ c Group Means c Md t p 
Group No. í ме, = (Каш) 
day: 
eri 1 167.58 5.36 .25 >.05 
Anoxic 74 60 Both jos 2^ 1.25 
` 1 83 60 Both Ld 
Contro А 137.94 3.45 4.03 <.01 
Anoxic i ad Е 151.83 | 
Control 38 60 M 195.66 5.10 | 0.28 >.05 
Anoxic p 60 M 197.24 
Control | ks | 100 Both 245.99 658 | 0.02 >.05 
Anoxic | Both 246.11 
Сига 100 | Шы F 105.28 | 4.98 2.91 <.01 
Anoxic | 100 Е 209.93 а 
Control 3 | ao | M 278.60 6.41 0.10 2.05 
noxic 6l 100 | M 219.21 
Ontrol 52 س‎ 


ج ا ت 
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differences were largely explicable by the 
differences between the two populations 
( control and experimental) of males. This, 
however, was in spite of the consistently 
inferior performances of the anoxic females 
as compared to the other groups. 

Although the anoxic females were sig- 
nificantly lighter than the control females 
(the only significant weight difference), the 
measures which would be expected to show 
consequential differences in behavior failed 
to do so. No differences were found in the 
swimming speeds in the straight tank. Dif- 
ferences in physical dimensions seemingly 
played an insignificant part in the explana- 
tion of differences in behavior. 

In the light of these facts, there is a pos- 
sibility that the learning data presented in 
this study were more favorable to the anoxic 
group than the differences discussed above 
would indicate. Possibly, one group, the fe- 
males, was selected against. This group, as 
suggested by the nature of the behaviors 
found here, would have been even more 
drastically affected had a comparable ratio 
of females to males been attained for the 
anoxic group as was attained for the con- 
trol group. Such an equitable distribution 
by sex, in all likelihood, would have made 
differences between the anoxic and control 
groups even greater. 


The differences between the several ex- 
perimental subgroups (subdivided by fetal 
age at the time of deprivation) are difficult 
to evaluate at this time. If one could make 
the necessary adjustments for species, de- 
velopmental rates, severity of deprivation, 
and the time and nature of measurement, 
these differences might be found to be of 
the same basis as those described by Meier 
and Menzel (1955) in chicks. In their 
study, oxygen deprivation (LD;,) at the 
end of 8 days of incubation (21 days is 
"full term") was particularly effective in 
producing behavioral dysfunctions. The 
concepts offered by them in explanation of 
the results dealt with “critical period." In 
this usage, the concepts had been general- 
ized from the critical periods in the growth 
of tissues and organs to critical periods in 
the growth of mature behavior patterns. 


Several incidental, unquantified observa- 
tions should also be added to the description 
of the experimental animals in this study. 
Гог the most part, none of the gross phys- 
ical abnormalities reported by Ingalls 
(1947а, 1947b) in his experiments with 
mice were found with the rats in this study. 
It was noted, however, that 5 anoxic ani- 
mals (of the 180 born) were blind and, 
consequently, were not used in the behavior 
measures. 

Certain behavioral “peculiarities” were 
observed in the experimental animals. They 
tended to be more “skittish” and fractious 
than the controls. In their mastery of the 
several learning measures used, they tended 
to be more variable (this was reflected in 
the la group variances). These char- 
acteristics were similar to those observed in 
the study on neonatal anoxia in cats, re- 
ported by Meier (1953). Plausibly, these 
behaviors can explain differences in learn- 
ing ability, especially since some of the 
differences tended to disappear as the ani- 
mal had more experience in learning situa- 
tions. That is, a deficit in learning ability 
may not be operative in these studies so 
much as a factor of heightened emotion- 


ality. 


Summary and Conclusions 


This experiment was conducted to meas- 
ure the effects of prenatal anoxia on maze 
learning ability in rats. Forty pregnant rats 
were exposed in а decompression chamber 
at an altitude of 30,000 ft. (6.2166 equiva- 
lent oxygen) for a period of 2 hrs, The 
rate of ascent and descent was 3,000 ft. per 
min. A total of 130 offspring of the ex- 
posed females were used in this study as 
experimental Ss, and their performance ой 
two different maze problems was comparet 
to the performance of 130 offspring of 4 
control animals. 

The maze problems consisted of a 14 unit 
modified Stone T water maze on which con^ 
trol and experimental animals were require 
to reach a criterion of one errorless trial 
and a single unit T water maze on whic? 
number of correct responses in 80 learning 
trials was the measure obtained. 
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Data were also procured for: retention as 
measured by relearning after a 30-day in- 
terval; transfer of training from the 14 unit 
T maze to the black-white discrimination 
problem on the single T maze; and the 
effects of learning an interpolated problem 
(the black-white discrimination problem) 
on relearning the original problem. 

In addition to the above data, other vari- 
ables considered were: weight of control 
and experimental animals at 60 and 100 
days of age, speed of swimming in the two 
groups as measured by a straight water 
tank, the number of days before birth the 
animals were exposed to oxygen de- 


anoxic € › { 
physiological disturb- 


ficiency, and possible 
ances caused by anoxia. 

The following are conclusions based on 
the data obtained : 

1. Prenatal anoxia severely affects an 
animal's ability to learn а 14 unit T maze. 
The animals deprived of oxygen for a short 
period during fetal life perform at maturity 
in an inferior manner to a control group in 
trials, and time. The dif- 


terms of errors, me. 
und to be significant be- 


ferences were fo 
vond the .01 level. | 
` 2. Retention as measured by trials to re- 
learn the 14 unit water T maze was signifi- 
cantly different between the control and ex- 
perimental groups. 

3. Anoxic animals which made the poor- 
est learning records were those whose 
period of oxygen deprivation preceded birth 
by approximately 10 days. | 

“4, The learning оп а simple black-white 
discrimination problem on a single T water 
maze significantly differentiated the learning 
abilities of 30 anoxic and 30 control 
animals. 

5. Positive transfer from the original 
ng problem to the simple black-white 
ion problem was slightly but sig- 
controls than in the 


learnit 
diseriminat 
nificantly greater in the 
anoxic animals. 

6. The black- 
lem, when interpol 
learning and relearnms ^ 
maze, led to reproductive ! 
groups. The anoxic anima 


white discrimination prob- 
ated between original 
g of the 14 unit T 
acilitation in both 
15 appeared to 
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profit more from the interpolated learning 
than did the control group. 

7. Generally, observable physical deform- 
ities were at a minimum. The only recog- 
nized deformity was blindness in five anoxic 
animals. 

8. Female anoxic animals weighed sig- 
nificantly less than did female control ani- 
mals; anoxic males did not differ from 
control males, even though the anoxic males 
were primarily responsible for significant 
differences between control and experi- 
mental groups on measures of behavior. 

9. Speed of swimming in a straight tank 
was not seriously affected by prenatal 
oxygen deprivation. 

10. The data do indicate that impairment 
in learning ability is a function of prenatal 
exposure to oxygen deficiency. 


Histological Analysis 


Following the completion of the maze 
tests, a limited sample of the animals was 
sacrificed and the brains prepared for his- 
tological analysis. The selection of the ani- 
mals was made on the basis of their per- 
formances on the multiple unit maze. This 
sample included the two best and the four 
poorest performers of the control group and 
the four best and the six poorest perform- 
ers of the experimental group (without re- 
gard to fetal age at the time of oxygen 
deprivation). In all, 16 Ss were evaluated 
in this fashion. 

Tn the data obtained. no significant histo- 
logical trend, as related to prenatal anoxia, 
was apparent. No qualitative differences 
(size and shape of cells, etc.) could be seen 
between the control and experimental ani- 
mals. While there were considerable varia- 
tions in thickness of the cortex of the 
cerebrum and in the relative cell counts in 
the same structure, these seemed to be due, 
largely, to variations within the individual 
animal and in no way correlated with per- 
formance on the multiple unit maze. 

Should prenatal anoxia cause cell anom- 
alies or death, these techniques on this 
selected sample did not show them—cer- 
tainly, these techniques did not show them 
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to be functionally related to the measured 
behavior of the Ss. 

(For a more extensive discussion of the 
techniques used, their rationale, and the 
results gained, see Appendix B.) 


Part II 

Two questions raised by the first half of 
this study made a replication of the experi- 
ment, in part, mandatory. (а) Since the 
dates of conception were not known, the 
fetal ages at the time of oxygen deprivation 

: yS | 
could only be roughly estimated. Moreover, 
these estimates, based on the number oí 
days prior to birth the fetuses were de- 
prived, seemed risky, certainly questionable. 
Can this form of insult, an exposure to de- 
compression with the attendant lack of 
oxygen, accelerate or, even, decelerate, pre- 
natal development and alter, thereby, the time 
of parturition? (b) The maximal effect of 
the deprivation was upon those animals 
born 10 days after the insult. The effect 
upon the young born 9 days or less or 11 
days or more after the insult was markedly 
less obvious. Could this in reality be a 
Ге ere al а 7 
“critical period atw hich time the develop- 
ing organism „1S uniquely sensitive to 
munis in available oxygen? Could there 
e other critical periods at earlier fetal 
as г M 5 f р 

pe ( Apparently fetal ages less than 11— 
£ days were not sampled in the first part 
ie this study.) These matters formed the 
2515 of a replication in which the fetal 
ages tested would represent the entire period 
of gestation, 
Subjects 
wentcording to plan, 66 pregnant female rats 
d to be used in this part of the study, Each 
i the days oí gestation, from 1—22, were to be 
no Md by 3 of the pregnant rats. Бао 
ately, this plan met with some difficulty ; only A 
Pregnant Wistar rats were received from a px 
Cer at the appointed time, This meant fully 
several of the fetal age groups were not fully 
represented (see Table 26). (The Е aa 
a Vaginal plug was accepted. by the pee ег A 
Positive indication of mating behavior.) eet 
Of the females of the group which had heen bre 
days prior 10 their receipt in fits: laboratory 
dropped their young within minutes of the sched- 
uled time of the oxygen deprivation. Consequently, 

ly ats 22, intervals in the fetal de- 
only 21, rather than 22, 1 k 
velopment were examined in this study. 


-Ipparatus and Procedure 


Deprivation. Forty-one of the pregnant females, 
2 from each of the 21 groups of 3, were placed in 
а large decompression chamber. ("There were ex- 
ceptions; these are indicated in Table 25, Where 
both the control and experimental groups could 
not be included in a fetal age group, because of 
small size, only the experimental group was repre- 
sented. As а result of this, there were 41 experi- 
mental pregnant animals and 19 control.) The 
pressure within the chamber was reduced until a 
simulated altitude of 30,000 ft. (8.88 т. Hg. ; 6.21% 
equivalent oxygen) was reached. This simulated 
altitude was reached in 15 min. The animals were 
maintained under these conditions Гог 120 min. 
after which time the pressure was returned tO 
normal, This, too, required 15 min. All in all 
they were under some degree of decompression 
and oxygen deprivation Гог 150 min. 

The females were placed in individual Б апе 
the respective dates of delivery of their litters 
were recorded, d 

Of the 254 young horn to the experimental 1€ 
males and the 124 born to the controls, 108 were 
selected for testing purposes. Since it was recog- 
nized that litter differences might be a major con- 
founding factor in this study (with the smal 
number of litters at each fetal age), the experi 
mental offspring were divided into five groups 
These groups, with their numerical size, were: 


s and 


Group I: Deprived 2-5 days prior to birth: 
14 5s. 

Group Il: 
14 Ss, 

Group III: Deprived 10 
19 Ss, 

Group IV: 
20 Ss. 

Group V: 
16 Ss. — 

Control Group: 25 Ss. 
( Originally, it was planned that there be but M 55 
in each of the 5 experimental groups and 25 in the 
control group. As the running of the Ns on the 
maze was more rapid than anticipated, the size 0! 
the last three experimental groups was increased.) 
The Ss for th groups were selected with the 
view of representing as many individual litters A5 
possible. No consideration was given to the sex 
ratios of the several groups. In all, there were 
83 Ss from the asphyxiated mothers and 25 front 
the control used for testing purposes. 

Behavior measures. The 108 animals were per 
mitted to mature under normal laboratory cond!” 
tions until the 64th—67th days at which time the? 
were pretrained on the straight swimway used 20 
the first half of this study. After 10 trials on BE 
swimway (completed within a single session) th 
animals were run on the multiple unit water m^ 
(also used in the first portion of the study) ut 
they reached the criterion of one errorless РСТ 


Deprived 6-9 days prior to birth : 


13 days prior to birth: 
—17 days prior to birth: 


Deprived 14 


Deprived 17-21 days prior to birth: 
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formance. On this apparatus the Ss were given 
3 trials each day. Besides trials to criterion, time 
and error scores were recorded. 


Results 


[ital statistics. Of the 41 pregnant ani- 
mals exposed to the deprivation conditions, 
3 died during or immediately following ex- 
posure. Two more died within 5 days of the 
exposure. There seemed to be no pattern 
in these deaths; deaths occurred within the 
entire range of gestation ages (see Table 
25). (No control females died within this 
period.) 

Only 44 litters were delivered; 30 to the 
experimental group and 14 to the control. 
Again, no pattern can be found in these 
failures; in fact, the notion that some of the 
experimental animals could have aborted 
does not seem to be tenable, since the num- 


ber of failures for the two groups are al- 
most the same (5, control; 6, experimental). 

Although there was a difference in the 
average length of the gestation periods in 
favor of the control group (0.42 days), this 
was not found to be significant. (See Table 
26.) More than likely, this difference is 
considerably less than the errors inherent in 
the setting of the date of mating and the 
date of delivery. A cursory inspection of 
Table 28 will reveal that only one delivery 
seemed to have an unusually short gesta- 
tion period and only two longer (consider- 
ably) than usual Other than these, the 
control and the experimental groups were 
very much alike. 

Again, although there were differences in 
litter size at birth and at 65 days of age in 
favor of the control group, statistical sig- 
nificance of these differences was lacking. 


TABLE 25 


Vira STATISTICS: І 
(Breeding and delivery data on the control (C) and experimental (X) females) 


No No. of Litters No. of Deaths Failures to Drop 
Mating in 
Date Group с X @ X С X 
8-1 1 1 
8-2 3 1 А 
oS 3 1 1 1 
8-4 3 ! 1 
8-5 = 1 1 
8-6 
3 2 1 
8-7 
8-8 3 1 2 
8-9 3 1 2 
8-10 3 1 2 
8-11 3 1 1 1 
8-12 3 1 1* 1 
8-13 2 2 1 
8-14 3 1 Ц й 
8-15 3 1 A 
E 3 1 2 
8-16 
3 1 2 
8-17 1 2 
8-18 d 2 1 
8-19 > 1* 1 1 
8-20 1 1 1 
8-21 3 1 2 
8322 3 
eR j 30 5 5 
Totals 60 и ° 


* Died during the oxygen deprivation. The o 


thers died within 5 days of exposure. 
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TABLE 26 


ViTAL Statistics: И 


Litter and Viability Data with Respect to Dates of Mating and Parturition 


Length of | | No. Dead | No. Alive 
Birth Breeding | Gestation No. No. Dead | at65 at 65 Dispen- 
Date Date (days) Group | Born at Birth | Days Days sation 
8-23 8-1 22 C 14 | 14 6 to € 
8-25 8-2 23 P 9 9 5101 
8-2 23 [e 9 1 1 7 
8-3 22 x 5 1 1 3 1001 
8-26 8-2 24 Ж 8 4 3 1 
8-3 23 x 3 3 Е 
8-3 23 С 9 1 8 210€ 
8-28 8-4 24 x 7 3 4 2 tol 
84 24 © 12 12 4 to C 
8-5 23 X 6 6 3tol 
8-6 22 x 6 6 3 tol 
8-30 8-7 23 x 13 1 2 10 4 tol 
8-7 23 X 11 10 1 8 
8-8 22 C 3 1 2 1 to C 
8-8 22 bid 6 6 
8-8 22 x 11 1 1 10 4 to IT 
8-31 8-9 22 x 10 10 6 to Il 
8-9 22 x 11 8 3 6102 
9-1 8-9 23 С 9 1 1 7 2 to C 
9-2 8-10 23 С 3 2 1 
9-3 8-10 24 X 12 1 2 : 6 to Ш 
8-11 23 x 10 E 
9-4 8-12 23 С 10 5 5 2 to C 
8-15 20 x 14 2 1 11 5 to Ш 
8-13 22 X 11 2 9 6 to Ш 
9-5 8-13 23 х 7 1 еа 
8-14 22 C 10 10 3toC 
9-6 8-10 21 x 7 3 4 3 to IV 
8-14 23 x 11 5 3 3 3 to IV 
8-15 22 С 6 6 
9-7 8-15 23 26 11 2 9 5 to IV 
9-8 8-17 22 € 8 6 2 
9-9 8-16 24 x 3 1 2 1 to IV 
8-16 24 X 13 1 12 8 to IV 
8-17 23 x 8 1 7 
8-18 22 С 10 2 $ 
9-10 8-16 25 G 9 9 3to C 
у 8-17 24 х 3 3 
8-18 23 x 9 1 1 7 2to V 
9-11 8-19 23 x 5 5 5 to 
9-12 8-19 24 X 4 4 Е 
9-13 8-18 26 X 13 1 12 5 to 
8-21 23 X 7 2 1 4 4 to 
8-22 22 € 9 2 7 2 to 
Number (control) 121 6 32 83 
Number (experimental) 254 28 66 160 
Mean (control) 8.64 7.54 
Mean (experimental) 8.47 6.40 
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Any differences between the two groups in 
the matter of litter size was attributable to 
difference in the litters at the time of birth. 
Further, there was no trend in the size of 
the litter and the gestation age at the time 
of deprivation. 

Та summary, with the procedures used in 
which the breeding and deliveries were 
checked regularly once every 24 hrs., no 
differences could be detected between the 
control and the experimental groups of ani- 
mals. Possibly, more frequent inspections 
could have shown differences between the 
two groups. 

Behavioral measures: Pretraining and 
maze performance. The performances of 
the animals were analyzed first by treat- 
ment, that is, by age at the time of depriva- 
tion, then by sex. As on most of the crite- 
rion measures the data were nonnormal ; 
these analyses were completed with the use 
of the Kruskal-Wallis (Siegel. 1956, pp. 
184-193) one-way analysis of variance. 
This particular analysis utilizes the data by 
ranks rather than by the actual raw data. 
(In these instances there was little loss of 
statistical power as compared to the usual F 
tests. Were the assumptions of F met, the 
asymptotic efficiency of the Kruskal-Wallis 
would be 95.5.) 


The median time 


scores for the 10 trials 


on the straight swimway Were not statistic- 
ally significant whether analysed by age at 
deprivation or by sex. The median scores 


Pon б ПЕ Ш Iv ¥ Totals 
Males 484 514 63.9 500 51.3 63.5 543 
Females 52.4 538 72.0 51.3 534 498 547 
Totals 50.3 52.7 674 50.6 52.6 583 


The variance M for treatments was 3.47; 
for sex, 0.00. (Гог significance at the .05 
level, [7 would have to equal or exceed 
12.83 for treatments and 5.02, for sex.) 

үле itself, seven different crite- 
evaluated. These were: 
on, (b) errors to crite- 
trial, (d) errors on the 
to criterion, (f) 
ime on the first 


On the ma 
rion measures Were 
(a) trials to criteri 
rion, (c) errors per th 
first three trials, (е) time 
time per trial, and (g) t 
three trials. 


Trials to criterion: no differences were 
found between groups on the number of 
trials required to reach the criterion of 
mastery (one errorless performance). The 
median performances were: 


€ 1 II II у ЗЕ 
Males 55.9 446 49.6 473 487 602 51.8 
Females 51.4 67.1 69.0 49.9 480 733 575 
Totals 53.7 574 580 485 48.3 65.1 


Both the H for treatment, 3.60, and the H 
for sex, 0.92, were clearly below the re- 
quired values (12.83 and 5.02, respectively) 
for significance at the .05 level. 

Errors to criterion: although the various 
treatment groups could not be differentiated 
on this measure of performance, the males 
and females were found to be significantly 
different (р < .05) on the number of 
errors made in reaching mastery. The males 
were vastly superior to the females in all 
treatment groups. F 


с r Jt NL. Y ios 
Males 382 478 57.5 471 309 422 475 
Females 59.8 62.1 86.2 592 504 687 624 
Totals 59.0 560 698 528 406 521 


H for treatment was 7.95; for sex, 6.09. 
(For significance at the .05 level this latter 
value must equal or exceed 5.02.) 

Errors per trial: again, no differences 
were found between groups due to treat- 
ment on the basis of the ratio of total errors 
to total trials to reach mastery for each 5. 
Nor were differences found in performance 
due to sex. 


C I IT Ш IV V Totais 
Males 562 634 67.9 568 317 364 509 
Females 63.9 48.6 65.1 677 618 352 585 
Totals 59.9 549 66.7 620 467 35.9 


Н for treatments was 10.8; for sex, 1.57. 


Errors on the first three trials: since one 
of the Ss mastered the maze in three trials, 
the data for all were analyzed through 
this point. By this we hoped to get a com- 
mon base line for all Ss unbiased by “ob- 
vious relationships" as would be expected 
to occur in the derived measures of be- 
havior, e.g., errors per trial, time per trial. 
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The groups did differ significantly (р < 
05) by treatments on the number of errors 
made in the first three trials. No differences 
were shown in behavior attributable to sex. 

€ I П ПЕ ТУ V Totals 


47.0 375 429 514 


Males 51.8 71.3 69.3 
Females 48.6 67.0 82.2 63.9 49.9 472 580 
Totals 49.8 688 748 550 437 445 


H for treatments was 13.41; for sex, 1.19. 
(For significance at the .05 level, // must 
equal or exceed 12.83.) 

Time to criterion: again, the groups dif- 
fered significantly (р < .05) by treatments, 
now in the total time required for the 
mastery of the task. Again, no differences 
in performance were found attributable to 
sex differences. 


€ I и HI IV.  V Totals 
Males 65.5 57.8 53.1 343 37.3 506 49.9 
Females 56.3 745 812 509 393 720 596 
Totals 611 673 65.1 422 383 58.6 


Н ior treatments was 13.65; Гог sex, 2.60. 

Time per trial: as might be expected from 
the significant differences in total time and 
the lack of significance in total trials (al- 
though the two trends were the same), а 
derived measure based upon these two also 
pointed to significant differences (р < .01) 
between the groups by treatment. No dif- 
ferences in performance were related to 
differences in sex. 


C I ПП ту V Totals 
Males 674 71.8 628 587 431 452 495 
Females 64.3 74.5 777 357 292 360 600 
"Totals 65.9 73.4 69.1 46.6 361 398 


Н for treatments was 23.06; for sex, 3.03. 
(For significance at the .01 level, H for 
treatments must equal or exceed 16.75.) 

Time on the first three trials: the groups 
as classified by dates of delivery after the 
period of oxygen deprivation, differed sig- 
nificantly (р < .02) on the times to com- 
plete the first three trials. Once more, no 
differences in time scores were attributable 
to differences in sex. 


C I H ш IV ут 
Males 65.7 721 674 35.6 460 490 549 
Females 514 783 686 494 421 394 54.0 
588 756 079 42 1 440 454 


Totals 


Н for treatments was 16.00; for sex, 0.02. 
(For significance at the .02 level, М for 
treatments must equal or exceed. 15.09.) 


Discussion 


On the face of things, the data on dura- 
tion of gestation justifies an extrapolation 
of the gestation age at the time of depriva- 
tion from the number of days which this 
deprivation preceded birth, as might have 
been done in the first part of the study. 
However, there are several qualifications 
that must be considered before concluding 
that this bout of oxygen deprivation has по 
effect on the duration of gestation. The first 
deals with the procedures used in this study 
for the determination of the data of breed- 
ing and delivery. Possibly, a closer check 
could have been made on the time of mat- 
ing. From the breeder, we learn that his 
procedures entailed а maximum error of 
12 hrs. Certainly, a more frequent check 
could have been made in the designation 0 
the date of delivery. Routine 12 hr., rather 
than 24 hr., observations would have been 
feasible. In all, more frequent observations 
could have materially reduced the proce- 
dural variance which in this instance was so 
much larger than the group differences. 

Second, work in the Vanderbilt Labo- . 
ratory has shown a regular variation in the 
date (and hour) of hatching of chicks 
which had been exposed to an acute oxygen 
deficiency during incubation. We have 
found that a median lethal deficiency at the 
end of the 8th day of incubation (full incu 
bation: 21 days) will delay hatching by an 
average of 12 hrs., whereas deprivation 2 
the end of the 19th day will accelerate 
hatching by 6 hrs. (Again, chicks аге 
uniquely suited to this type of research; 
here, because the determination of date © 
“birth” is both easier and more meaning 
for the individual S.) 

Although significant differences were de 
tected among treatment groups only on thé 
last four measures, their pattern of рег 
formance was consistent on all seven. The 
poorest performances were observed in ап” 
mals born 6-9 days following oxygen дер! 
vation (gestation age: 13-16 days). 
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best performances were observed in those 
animals born 14—17 days following depriva- 
tion (gestation age: 5-8 days). In broad 
outline, these data in part 2 are consistent 
with those of part 1. 

In some respects, however, the notion of 
critical periods was not materially strength- 
ened. A cursory observation of the group 
trends, both on separate and pooled meas- 
ures, reveals an increased susceptibility to 
impairment by this insult as the animals 
mature to parturition. Nevertheless, it 
should be pointed out that the performances 
of the animals deprived very early in gesta- 
tion were as different from the controls as 
those of the animals deprived very late. On 
most measures, the control group stood in 
between these two age groups. 

Conceivably, some factor could be af- 
fected in early gestation which would tend 
to "improve" the Ss' behaviors, as measured 
here. (At least, the time scores were short- 
ened, the error scores were kept to а mini- 
mum, etc.) Likewise, some factor could be 
affected in middle and late gestation which 
would tend to “downgrade” the Ss’ be- 
haviors, again, at least as measured here. 
Given the two separate functions for these 
factors, one could justifiably speak of two 
critical periods centering about the two 


phases of the animals’ prenatal develop- 


ment. 


Summary and Conclusions 


This replication was made to check on 
the effect of oxygen deprivation on the 
length of gestation and the possibility of 
"critical periods" earlier in gestation than 
noted in the earlier part of this study. 

Forty-one pregnant Wistar rats were ex- 
posed to a simulated altitude of 30,000 ft. 
for 120 min. These animals represented, in 
groups of two (with irregularities, some- 
times less), 21 stages in the fetal develop- 
ment of the rat. Following exposure, the 
animals were placed in individual cages and 
the dates of delivery oer mn 

rogeny of 19 pregnant anima 
Базы абала] development were 


the controls. 


From the data collected, no differences 


would be ascribed to the experimental and 
control groups with reference to duration of 
gestation, size of litter at birth, and size of 
litter at time of running on the behavior 
measures. 

At the age of 64—67 days the animals 
were run 10 trials on a straight swimway 
following which they were run on a 14 unit 
water maze to the criterion of one errorless 
trial. On the basis of the performances on 
these apparatus we can conclude: 

l. There were no differences in median 
time scores for the 10 trials of the straight 
swimway. This might be construed to mean 
that there were no differences in activity 
levels between the experimental and control 
groups. 

2. No differences were noted between the 
groups on the number of trials or errors to 
reach the criterion of one errorless per- 
formance. Further, no differences were 
found in the number of errors per trial 
made by the two groups of Ss. 

3. Significant differences were found in 
the time measures of behavior, the time to 
criterion (р < .05), and the time per trial 
(p < 01). 

4. As one S (control) mastered the maze 
in three trials, analyses of the behaviors of 
all of the Ss were made through this point. 
There were significant differences on both 
the number of errors made in these three 
trials (р < .05) and in the time required 
for completion of these trials (р < .02). 

5. Consistently, Group II made the poor- 
est performances. This group was com- 
prised of animals born 6-9 days following 
the oxygen deprivation (gestation age: 13- 
16 days). This tends to confirm the findings 
of part 1 of this study. 

6. Consistently, Group IV made the best 
performances. This group was comprised 
of animals born 14-17 days following the 
deprivation. (Gestation age: 5-8 days.) 
Conceivably, the performances of this group 
and of Group II indicate the existence of 
two unique “critical periods.” As measured 
in this study, an insult in one period may 
have a facilitative effect whereas an insult 
in the other may have a deleterious effect 


on maze learning. 
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SUMMARY AND CONCLUSIONS 


The three experiments reported in the 
present study are primarily concerned with 
the effects of natal and prenatal anoxia 
upon subsequent behavior, and upon learn- 
ing and retention at maturity in rats and in 
cats. 


Results are reported on the effects of a 
single exposure to anoxia, at birth in ex- 
periment 1 and at different stages during 
the prenatal period in experiment 3, on the 
ability of white rats at maturity (а) to learn 
a difficult and reliable maze problem (14 
unit multiple T pattern), (b) to utilize the 

previous learning of the maze problem in 
the later learning of a simple discrimination 
task (transfer of training), (c) to retain the 
maze habit over an interval of 30 days 
under normal conditions during the interval, 
(d) to retain the maze habit over an inter- 
val of 30 days under conditions designed to 
produce retroactive inhibition, (e) to ex- 
tinguish a bar pressing response recently 
acquired, and (f) to reverse a simple posi- 
tion habit shortly after its mastery. In all, 
data are reported on approximately 474 
white rats in the learning and retention 
studies. Approximately one half were con- 
trol and one half were anoxic animals. 

Behavior data are presented in experi- 
ment 2 on cats showing the effects of а 
single exposure to anoxia at birth upon 
their ability to learn, as measured in the 
following battery of tasks: Guthrie puzzle 
box, single unit T maze, double alternation 
problem, and a brightness discrimination 
problem. Also investigated, but without 
positive results, were the effects of natal 
anoxia upon adult EEG patterns in the cat 
and the influence of prenatal anoxia upon 
gestation period and upon structural brain 
damage in adulthood in the rat. 

The experiments have shown the follow- 
ing results: 


1. Natal Anoxia in Rats 


a. Thirty minutes of oxygen deprivation 
(2.9196 equivalent oxygen) following birth 
had no significant effect upon learning or 
retention in rats of a difficult maze problem 
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nor upon the extinction of a bar pressing 
habit. 

b. Sixty minutes of oxygen deprivation 
under similar conditions, on the other hand, 
produced significantly inferior learning and 
retention of the multiple unit maze, in terms 
both of number of errors made and number 
of trials required to reach the criterion of 
mastery. 

c. Sixty minutes of the severe oxygen de- 
privation did not significantly affect the 
learning of a simple position response but 
it did have a deleterious effect upon the 

reversal of that habit. 


2. Natal Anoxia in Cats 

a. Rapidity of response in the puzzle box 
was significantly decreased. 

b. Poorly adapted responses, such as 
“stereotypy,” were significantly increased 
in certain discrimination tasks: learning а 
position response rather than the required 
discrimination. 

c. А reduced ability to sustain a symbolic 
solution once the solution had been reached. 

d. There was no indication that oxygen 
deprivation affected the ability to learn a 
brightness discrimination where there was 
close spatial relationship and temporal prox- 
imity of response between instrumental ob- 


jects and reward. 


3. Prenatal Anoxia т Rats 


In Part I, in which 40 pregnant rats were 
exposed to 6.2196 equivalent oxygen for 
two hrs. and 130 offspring were compared 
with 130 offspring of 40 control mothers, 
the effects of prenatal anoxia upon learning. 
retention, and transfer were as follows: 

a. The animals deprived of oxygen dur- 
ing fetal life performed in a significantly in- 
ferior manner to control animals in terms of 
errors, trials, and time required to learn the 
multiple unit maze. 

b. Retention of the maze problem, as 
measured by trials to relearn, was signifi- 
cantly impaired. 

c. Learning the simple black-white dis- 
crimination problem was significantly 
poorer in the anoxic as compared with the 
control animals. 
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d. Slight but significantly less positive 
transfer was shown by the anoxic than by 
the control animals from the maze problem 
to the black-white discrimination problem. 

е. Anoxic animals which made the poor- 
est learning records were those whose 
period of oxygen deprivation preceded birth 

by approximately 10 days. 

f. Retroactive inhibition in the retention 
of the maze problem was not produced by 
the interpolated discrimination learning, in 
either the anoxic or control animals. 

g. Observable physical deformities were 
at a minimum; the female anoxic animals 
weighed significantly less than did female 
control animals. 

Part II replicated Part I with the modi- 
fication that the single exposure of pregnant 
females of 6.21% equivalent oxygen fora 
period of two hrs. was studied at 21 differ- 
ent stages in the gestation period. The com- 
parison of the prenatal anoxic group with 


control animals showed the following 


results : 
1. There were significant differences be- 


tween anoxic and control animals indicating 
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superiority of the latter in the following 
measures in learning the maze problem: the 
time to criterion, the time per trial, and 
number of errors made on the first three 
trials. There were no effects of anoxia in 
the number of trials or errors made in 
learning the problem to mastery. 

2. The gestation age at which oxygen de- 
privation occurred was a significant factor 
with respect to maze learning ability. Ani- 
mals born 6-9 days following deprivation 
(gestation age: 13-16 days) consistently 
made the poorest performance. Animals 
born 14-17 days following deprivation 
(gestation age: 5-8 days) consistently made 
the best performance. 

3. Vital Statistics: of the 41 pregnant 
animals exposed to the deprivation condi- 
tions, three died during or immediately fol- 
lowing exposure and two more died within 
five days of the exposure. Mortality did not 
seem to be related to a particular gestation 
age at time of deprivation. No control fe- 
males died during this period. 

4. No differences were found between 
anoxic and control groups regarding dura- 
tion of gestation period. 
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APPENDIX A 


ELECTROENCEPHALOGRAPHIC DATA 


Even though the analysis of electroencephalo- 

graphic frequencies and amplitudes for differences 
in the normal and natally anoxic groups of cats 
proved negative, certain regularities appeared in 
both groups as related to the development of elec- 
trical activity of the brain. 
_ The frequencies of the EEGs through matura- 
tion typically increased until the third or fourth 
month when the adult level was reached. At birth 
the dominant frequency was less than 1 cycle per 
second (cps) with occasional “runs” of greater 
frequencies. At the end of the first month, 3-5 
cps activity was typical; at the end of the second 
month, 5-7 cps; and at the end of the third or 
fourth month, 6-9 cps. These last frequencies 
were those found at the end of the sixth month 
and, again, at the final recording: 7-9 cps activity 
was representative of the frontal leads and 7-8 cps 
of the occipital. 

This listing of “dominant” frequencies should 
not be construed to mean that these were the only 
frequencies seen at the particular recording. The 
spectra of frequencies were much the same for all 
ages, that is, both slow (1-2 cps) and fast (10-15 
cps) appeared in the records at all ages. The fre- 
quencies listed as dominant are those which ap- 
peared most frequently in the record of a given S 
for a given age. This does not imply, further, 
that these frequencies constituted necessarily the 
majority of the record. 

„Dominance appeared in the occipital leads begin- 
ning with the third recording (end of the second 


month). Almost universally, the left occipital 
records had greater amplitudes than the right. 

As indicated carlier (cf. Apparatus and Proce- 
dure) the clectroencephalographic recordings at tlie 
time of deprivation demonstrated a pattern о! 
activity correlated with the severity and duration 
of anoxemia. Prior to the deprivation the fre- 
quencies were typically less than 1 cps within fre- 
quent sequencies of 3-5 cps activity. During the 
earlier stages of deprivation, about the stage when 
the equivalent oxygen in the chamber was reduced 
by half, the frequencies increased and regular se- 
quences of 3-5 cps activity prevailed. This was 
concurrent with increased motor activity of the 5; 
the increased electrical activity appeared to be 
causally related to motor action. After this point, 
the frequency and amplitude diminished continu- 
ously until the deprivation bout was concluded. 
From many of the Ss, no activity could be re- 
corded at this time; that is, by this instrumenta- 
tion both the frequency and the amplitude were 
reduced to “zero” (5 microvolts produced ss in. 
deflection). Occasionally, however, some of these 
periods of "no activity" were disturbed by high 
amplitude (high frequency bursts that were coin- 
cidental with gasping behavior), the last reflex to 
be abolished. Whether any relationship existed 
between the "absence" of clectrical activity (rather 
than a mere diminution of frequency and ampli- 
tude) at this time and defective behavior later 
could not be discerned from the data collected in 
this study. 
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APPENDIX В 


HISTOLOGICAL TECHNIQUE AND RESULTS 


The preparation of the material for histological 
examination of permanent changes attributable to 
prenatal anoxia was as follows : 

The animals were first anesthetized (40 mg. 
pentobarbital/kg. body wt., administered intra- 
peritoneally) and then killed by perfusion with an 
isotonic saline solution followed by a 10% solution 
of formaldehyde. 

The brains were removed and stored for at least 
two weeks in the 10% formaldehyde solution be- 
fore preparation for sectioning was begun. Save 
for the cerebellum with the attached pons and 
medulla, the brain was cut transversely to the 
long axis, perpendicular to the surface of the 
cerebrum. For the purposes of this analysis, two 
consecutive sections (cut at 10 micra) out of each 
16 were mounted and stained with thionin, a Nissl 
stain. However, only the sections through the 
cerebrum in the region of the thalamus were 
examined. 

The rationale for this procedure was founded 
on certain facts of neuronal degeneration and on 
certain findings reported by other investigators. In 
all instances in this study, the deprivation of 
oxygen occurred at least four months prior to the 
removal of the brain. This meant that any change 
in the morphology of the individual cells would, in 
all likelihood, be corrected, or, if not, the cell 
would have died and complete degeneration taken 
place with the subsequent removal of the debris. 

ther investigators, notably Hurder (1952), have 
Teported that the significant neural difference be- 
tween animals exposed to anoxia and their controls 
Was in the number of neurones per unit of cortical 
tissue, Although some evidence indicates the pos- 
sibility of localized and diffuse lesions (Windle et 
al, 1944), it seemed, from what is known of the 
degenerative process and from the empirical data 
collected in an experiment similar to the present 
One, that the best procedure would be to que 

10 tissues for cell counts. This, however, wouk 
Not necessarily limit the possibilities for locating 
and identifying lesions and other gross anomalies. 


TECHNIQUE 


For Si i free of artifacts and 
ana four sections A 
Spit : ace of the brain, 


clearly icular to the surf. i 
Were лу each animal. These sections 
Were as widely distributed as was possible while 
Still complying with the above limitations. 
Before any quantitative analysis was made, = 
Sections were first e served for structural an 
Cellular abnormalit. й, tumors, variations in ap- 
Parent size of cells, nd evidence of recent trauma. 


The comparisons between the two pm "t 
8arding cell number proceeded along two lines. 


was first recognized that as a consequence of 
oxygen deficiency, some of the cortical cells could 
be lost. In the adult tissue, however, the loss could 
be signified by a reduction in the thickness of the 
cerebral cortex or by a reduction in the number of 
neurones per unit of cortex. As the rats were 
in the fetal stage at the time of deprivation, both 
indications of reduction in the absolute number of 
cells were likely, Consequently, attempts were 
made to determine both with this material. 

The thickness of the cortex was measured with 
an eyepiece micrometer (with a magnification of 
35X) at five pairs of corresponding points on 
either side of the sections. 

For the estimation of relative cell density, counts 
were made on 25 fields, randomly selected, from 
+ sections, At a magnification of 450X, the counts 
were made as follows: using an eyepiece equipped 
with one short and four long pointers, the observer 
counted the number of pyramidal cells (as indi- 
cated by the appearance of the nucleus on the 
particular section at the particular level of focus) 
in layers III and V covered by the tips of the 
iour long pointers. (The short pointer was used 
to focus the field.) It was possible, thereby, that 
within a single field as many as 4 or as few as 0 
could be so covered by the tips. For the 25 fields, 
100 cells could be so covered—or, possibly, none. 
The number of such "hits" was the datum for the 
individual $ for intersubject and intergroup com- 
parisons. 

The reliability of this procedure was checked 
with material prepared in the same fashion as this 
but taken from animals other than those used in 
this experiment. These results are given in Table 
27. А coefficient of concordance (Kendall's W 
[Siegel, 1956]) computed on these counts was 499 
which is significant at the .05 level. 


TABLE 27 


RELIABILITY CHECK ON TECHNIQUE FOR 
CELL CouNT 


Count 


Section and Slide | 1 | 2 | 3 | 4 | 5 |Mean 
1 18 | 17 | 14 | 19 | 16 | 16.8 
2 15 | 14 | 13 | 14 | 16 | 14.4 
3 18 | 16 | 19 | 19 | 17 | 17.8 


Note.—The material for this reliability check was taken from 
a single animal, а normal adult female rat. 
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RESULTS 


The qualitative examination of the tissues re- 
vealed no significant structural difference between 
the experimental and control animals. The size and 
shape of the cells were uniform in both groups of 
animals. The only pathology which could not be 
attributed to artifact was a glial module in one of 
the female control “dulls.” 

In estimating absolute cell number from the 
thickness of the cortex, the observers found greater 
within subject variability than between subject 
variability. The measurements differed more with 
the portion of the brain from which the section 
was taken than from one S to another. (In some 
instances, unfortunately, the sections from the con- 


TABLE 28 


COMPARATIVE DENSITIES OF LAYER 


s III AND V or CONTROL AND EXPERIMENTAL GROUPS 
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trol and experimental animals were not taken from 
corresponding portions of the brains—even though 
the original stipulation that the selection be 
through the region of the thalamus was fulfilled.) 
The range of thicknesses was from 50 to 110 units. 

Finally, as can be noted in Table 28, no expected 
differences were discerned on the basis of relative 
cell counts. As it turned out, the counts for the 
experimental animals were higher than the counts 
ior the comparable control groups. The means 
were: "bright" controls: 184; "bright" experi- 
mentals: 19.9; "dull" controls: 14.7; and "dull" 
experimentals: 17.6. Any statistically significant 
differences here would be contrary to the predicted 
findings. 


= сении 
Меап 
Group Animal| Sex Layer Count | Mean ПІ | Mean V Total 
Bright Control 49 M Ill 16.7 17.8 19.0 18.4 
V 14.5 
81 M ш 19.0 
у 23.5 - 
Bright Experimental 42 F ш 17.0 22.4 17.5 19.9 
У 17.0 
36 Е III 23.0 
V 16.2 
48 M ш 24.2 
V 12.5 
85 M III 25.2 
У 24.2 
Dull Control 21 F III 14.5 15.8 13.6 14.7 
V 13.7 
39 Е III 16.1 
У 12.6 
43 Е ш 17.0 
У 15.5 
20 м ш 1555 
У 12.8 
Dull Experimental 38 F Ш 22.2 18.0 17.1 17.6 
У 21.5 
62 Е III 19.7 
V 14.5 
40 M ш 19.0 
У 17.0 
76 M III if A 
У 17.2 
80 M Ill 17.3 с 
У 
88 м III 
V 
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НЕ major thesis of this study, repre- 
TES the outcome of several investi- 
gations of the perceptual and personal 
Characteristics of intense "liberalism" and 
"conservatism," is that certain important 
similarities between the two have not been 
adequately investigated. In this respect, the 
present research deviates from the usual ap- 
proach, which focuses upon apparent differ- 
ences. 

In a sense, however, this study has con- 
tinuity with previous investigations. His- 
torically, the direction of theory and re- 
search has been to demonstrate the gener- 
ality and inclusiveness of social attitudes. 
Early studies, for example, viewed preju- 


^^ dice more or less as a specific and inde- 


pendent attitude. More recently, the ten- 
dency has been to regard it as related to the 
entire personality, as part of a highly gen- 
eralized system, manifesting itself even in 


\ ‚ Situations where social issues are not pres- 


ent. The evaluation of social attitudes as 
, Conceived in the present study expands the 
Problem by viewing extreme attitudes as 
alternative phenotypic manifestations of the 


Same or similar genotype. 


! This study received the award by the American 
“\Ssociation for the Advancement of Science in 
7 for meritorious sociopsychological research, 
and is based upon a doctoral dissertation at New 
ork University, 1954. | j 
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SIMILARITIES IN THE STRUCTURE OF 
EXTREME SOCIAL ATTITUDES? 


IRVING A. TAYLOR? 
New York University 


The primary purpose of this research was 
to explore perceptual reactions of individ- 
uals holding opposing positions on a general 
social attitude continuum ranging from rela- 
tively extreme “conservatism” to extreme 
“liberalism,” the terms referring here to the 
traditional “left-right” distinction. Specifi- 
cally, it was our aim to compare the precep- 
tual reactions of individuals holding opposed 
social attitudes with regard to closure or 
completion tendencies manifested in their 
reproductions of relatively neutral material 
not directly related in any apparent manner 
to social issues, 

Perception was regarded as a fruitful ap- 
proach for obtaining relevant information 
since neutral material could be used. This 
allowed for a direct comparison of under- 
lying personality tendencies of individuals 
expressing opposing social attitudes without 
the interference of ego controls provoked 
by the verbal content of attitude scales. 

This study of similarities revealed through 
perception was prompted by the considera- 
tion that the intensity and degree of a social 
attitude is equally important or even more 
important with regard to social relations 
than the manifest content or direction of 
that attitude. Moreover, perceptual behavior 
seems to have more to do with the general- 
ized quality of the character structure than 
does the attitudinal content. It must be 
emphasized, however, that indications of 
underlying similarities do not reduce the 
reality and importance of overt social dif- 
ferences. Rather, it is the writer’s intention 
to throw additional light on the structure of 
extreme social attitudes. 

An operational description of the key 
terms used in the investigation may serve to 
clarify subsequent questions. The term per- 
ceptual closure is interpreted as signifying a 
general tendency to complete what is incom- 
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pletely presented in the field of vision, pre- 
sumably for purposes of effecting a more 
stable perception. This is measured in terms 
of the degree to which an individual reduces 
the size of openings when reproducing a 
figure containing several opened lines. The 
essential conditions involved are (a) a situ- 
ation which is clearly incomplete, and where 
ambiguity and discovery are not essential 
factors, and (b) an opportunity to resolve 
the open situation through overt activity. 

To avoid confusion with other current 
usages, extreme “liberalism” refers here to 
a relatively intense ideological predisposi- 
tion characterized by acceptance of minori- 
ties and by the rejection of authoritarian 
power groups and beliefs. The essential 
criteria are (a) a direct and positive indica- 
tion of acceptance of ethnic minority groups 
and (b) a direct and positive indication of 
rejection of authoritarianism. The impor- 
tance of obtaining direct and positive indi- 
cations of both acceptance and rejection is 
discussed in a later portion of this mono- 
graph. 

Extreme “conservatism” is conceived as 
а relatively intense ideological predisposi- 
tion involving acceptance of authoritarian 
power groups and beliefs and hostile atti- 
tudes toward ethnic minorities. The essen- 
tial conditions are (a) a direct and positive 
indication of acceptance of authoritarianism 
and (b) a direct and positive indication of 
ethnic minority rejection. The formulation, 
again, includes acceptance and rejection 
components. 

Attitudes which reflect neither strong 
prejudices with regard to any social group 
nor strong beliefs with regard to any social 
issue are regarded as reflecting moderate or 
intermediate attitudes. These attitudes, 
therefore, lack the intensity involved in ex- 
treme ideological predispositions. 

'The selection of perceptual closure as a 
variable for investigation is prompted 
largely by what may be described as struc- 
tural similarities suspected to underlie the 
behavioral pattern of extreme social groups. 
Ву a structural similarity we mean a char- 
acteristic such as the intensity of an atti- 
tude which can be expressed in apparently 
diverse attitudes. In many respects this 
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formulation is similar to Klein's (1951) 
conception of Anschauung or "perceptual 
attitude," as indicated in the following 
passage: 

The term Anschauung is preferable to “attitude” 
because of the rather more narrow, well-worn 
connotations of the latter term in American PSY” 
chologies, particularly in social psychology. Attitude 
usually implies a quite specific content and a direc” 
tion toward or away from an object, In this 
common meaning it carries no implications 0 
formal personality structure, But our use of it 15 
precisely in the latter sense as а genotypic principle 
of control, having no ties to specific content, not 
necessarily related to particular conflicts ог stresses 
and with counterparts in all forms of cognitive 
behavior (p, 332). 

Thus, the extreme conservative may E 
press hostility and prejudice toward mem 
bers of minority groups; the extreme liber?" 
similarly, may express the same degree ° 
hostility and prejudice toward members e 
antidemocratic groups. Stereotypy, fre 
quently associated with the fascist, 15 also 
apparent in the extreme "left"; “Negro; 
are inferior" and “Capitalists are exploiters 
are both highly stereotypical. As oppose" 
social extremists proceed farther apart ideo 
logically, the structural similarities appe? 
to become even more similar. Whereas bot 
the groups and the institutions toward which 
the attitude is directed may differ widely 
the intensity is comparable. Extreme h05“ 
tility, prejudice, and stereotypy appear to 
exaggerated forms of behavior, presumabl 
having the function of reducing tensio” 
initiated by open social situations. 
tendency toward extreme social attitudes ! 
reflected by the nature of the perceptual К. 
production technique utilized in the prese? 
study. 

To the extent that several studies nav? 
already supported the general hypothe?” 
that extremely conservative or ethnocent* 
individuals manifest closure behavior, ^, 
closely related behavior, in a situation Ф, 
parently unrelated to social issues, our e 
tention is not new. What is novel іп o 
research, however, is the hypothesis Ha 
this tendency is similarly discernible i? 7 » 
perceptual reactions of those of the “Je у 


«alld 
fica’ 


The following hypotheses are speci d 


: E e 
formulated to test the relationship betw 


te >» 
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perceptual closure and extreme social atti- 
tudes. 

1. There is no apparent difference in per- 
ceptual closure reaction between individuals 
predisposed toward extreme conservative 
social attitude and those predisposed toward 
extreme liberal social attitude. 

2. Individuals predisposed toward ex- 
treme liberal social attitude manifest greater 
perceptual closure reaction. than those ex- 
hibiting moderate social attitude. 

3. Individuals predisposed toward ex- 
treme conservative social attitude manifest 
greater perceptual closure reaction than 
those exhibiting moderate social attitude. 


RELEVANT LITERATURE 


The Authoritarian Personality 


One of the major contributions to the 
area of ideological predispositions has been 
The Authoritarian Personality (Adorno, 
Frenkel-Brunswik, Levinson, & Sanford, 
1950). Early in the investigation, a scale 
was produced having as its purpose the 
measurement of ethnocentrism (E scale), 
designed to measure prejudice directly. In 
subsequent analysis, however, another scale 
was evolved, the California Fascism Scale 
(F scale), having as its purpose the meas- 
urement of implicit antidemocratic trends, 
Or prefascistic potential, at the personality 
level, as distinct from manifest antidemo- 
Cratic tendencies. The F scale was intended 
to yield an estimate of minority group рге]- 
udice as measured more directly by the E 
Scale, The final revision of the F scale ш 
Опе used in the present study) correlate 


75 with the E scale. | | 
The major variables underlying the items 
Of the Е scale included conventionalism, 
authoritarian submission and aggression, 
anti-intraception, superstition and ea 
Туру, and projectivity and sex. These y р 
regarded as central factors related to sur аё 
Attitudes, These variables, according to the 


“ taken 

Authors (Adorno et al., 1950) “... 
Е tent of the 
‘Ogether, made up the basic conte more Or 


scale. Each was regarded as i 
SSS central trend in the person us 
“cordance with some dynamic process, 


hich, in 
ex- 
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pressed itself on the surface in ethnocentrism 
as well as in diverse psychologically related 
opinions and attitudes” (p. 228). 

The items of the scale were constructed 
according to a principle of maximum in- 
directness with regard to overt prejudices. 
In the final revised scale all items were 
formulated in terms favorable to antidemo- 
cratic practices and beliefs. It is important 
to note, therefore, that a low scorer—one 
rejecting scale items—is “democratic” or 
“liberal” by implication only. 

The procedures utilized by the authors of 
The Authoritarian Personality included in- 
terview, clinical, and questionnaire tech- 
niques. The primary basis for interpreting 
individual predispositions was derived essen- 
tially from interview protocols of extreme 
scorers, those tending either to agree or dis- 
agree strongly with the items of the scale. 
Linearity of social attitudes was, therefore, 
assumed, and the middle group was conse- 
quently excluded. The significance of this 
exclusion will be discussed shortly. 

A major finding of The Authoritarian 
Personality was the identification of a well- 
defined syndrome characterizing high scor- 
ers. Subsequent studies have attempted to 
substantiate these findings as well as demon- 
strate further differentiations between low 
and high scorers. In many instances, how- 
ever, the practice of excluding middles has 
been continued, thus perpetuating the as- 
sumption of “left-right” linearity and ob- 
scuring any curvilinear tendencies that may 
have been revealed in the data. In addition, 
most studies have either implicitly or ex- 
plicitly identified low scores with “liberal- 
ism" despite the inferential basis of this 
interpretation and without regard to the 
fact that the authors of The Authoritarian 
Personality found the low scorers to be rela- 
tively more diverse as a group. 


In an early study by Rokeach (1948), individuals 
above the median of the E scale were found to be 
more rigid with regard to solving Einstellung 
problems than those below. That the highly ethno- 
centric individual approached social problems more 
rigidly was apparent. The finding that this carried 
over to situations not directly related to social 
problems led Rokeach to postulate a factor of 
“generalized mental rigidity” in ethnocentrism. A 
further study by R. W. Brown, (1952), in which 
the F scale was used in conjunction with the 
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Einstellung problems, revealed that differences in 
problem solving rigidity between low and high 
scorers became manifest only under conditions of 
ego involvement. 

Rokeach has shown differences between low and 
high scorers in several other studies. In one study 
(Rokeach, 1950) the Levinson 10 item Ethno- 
centrism Scale was administered and subjects (Ss) 
were asked to describe the way in which they 
thought 10 ideological terms (Buddhism, Capital- 
ism, Fascism, etc.) were interrelated. The low 
quartile demonstrated the greatest amount of com- 
prehensive organization, the middle quartiles a more 
isolated organization, and the high quartile a 
narrow one, thus indicating a linear trend. In a 
further study (Rokeach, 1951), Ss were asked to 
define these ideological terms. Although the lowest 
quartile in the Е scale responded more frequently 
with relatively abstract definitions as expected, they 
were also found to favor the type of “reification” 
usually associated with the higher scorers. Rokeach 
(1952) has more recently produced a scale intended 
to measure the "dogmatic personality" reported to 
be independent of the traditional "left-right" con- 
tinuum. 

In a study by Frenkel-Brunswik (1951), ethnic 
Prejudice was linked with “intolerance of am- 
biguity.” Noting an inherent ambiguity intolerance 
in the social responses of prejudiced children, the 
hypothesis was tested that this intolerance reveals 
itself also in neutral material. А technique was 
utilized in which a series of pictures were pre- 
sented one at a time beginning with a clear percept 
of a dog which gradually transforms into a cat. 
The results indicated that children high in ethnic 
Prejudice perseverated longest with their initial 
response as the picture series progressed. Also 
included in her study were cognitive materials in 
which similar results were obtained, She concluded, 
“There is some indication of a prevalence of pre- 
mature reduction of ambiguous cognitive patterns 
to certainty in the prejudiced subjects, as revealed 
by a clinging to the familiar . . ” (p. 140). She 
further noted a close relationship between “intoler- 
ance of ambiguity” and Rokeach’s “generalized 
mental rigidity.” 

Highly similar to the two studies mentioned above 
is the one by Kutner (1958), in which children 
were administered problem solving tests. Children 
scoring low in ethnic prejudice gave evidence of an 
ability for abstract conceptualization while high 
scoring children presented a more concrete approach 
to problem solving with an accompanying higher 
rate of failure, 

The results of an experiment by Block and Block 
(1951) indicated that Ss relatively high in ethno- 
centrism were quicker to establish autokinetic 
norms than relatively low scorers. This rapid 
establishment of frames of reference was identified 
with excessive ego controls by the authors although 
the similarity to Frenkel-Brunswik’s “intolerance of 
ambiguity” was noted. The authors, in discussing 
their statistical approach, make the following some- 
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what dubious the middle 


group: 


comment concerning 


Although the latter analytical procedure [chi 
square] probably capitalizes somewhat on chance 
in the decision as to how many subjects are to be 
included among the ‘Highs’ and ‘Lows,’ it is sub- 
stantially justified as a means of removing the 
middle range of ethnocentrism scores. It is here 
that discrimination is most difficult to achieve ат 
such relationships as exist tend to be obscure 
(p. 309). 


The middle group, however, can be considered 
as obscuring a relationship only if it is assum 
to be linear, 

In an experiment involving changes in е 
memory of geometric forms, Fisher (1951) P 
found that high scorers in ethnocentrism shows 
а strong tendency toward symmetry and simpli 
cation, whereas low scorers were more able 2 
"break" levelling tendencies. Although it М 
hypothesized that the law of Prügnanz would a 
to both extremes, only the lows would witlist? 5 
or "break the Gestalt.” Although Fisher S 
at the outset that ethnocentrism is considered ? e 
continuous variable, the data only for extre. 
are presented with a cautious note that the re 
are limited to the extremes. It is not clear M 
the data for the middle group were not presen st 
since this would have allowed for a direct bo 
and demonstration of linearity, and would no 
eliminated the necessity for confining the re$" 
to extremes only, t 
Frenkel 


In a study by Barron, reported by ic 
” T 


jrunswik (1954), in which an aesthetic prefere {5 
test was used, И was found that ©“, . . subj 
preferring the [simple] symmetrical figures t 
to be less original, less intellectual, less. sens!’ 
less internalized, more constricted, more conserve. 
tive, more ethnocentric, more conforming, and 8 
independent" (р. 249). Many other studies using 
the California scales have reported differen, 
between extreme low and high scorers, Ш п 
Study by Evans (1952), prejudiced Ss more oft? 
favored political and economic values, while Ek 
prejudiced Ss tended more toward the acsth? 
and theoretical, revealed by the Allport-Ver™ 
Study of values, I. A. Taylor (1951) has demo 
strated a slight tendency for high scorers 4 
anticipate the correct sequence of events Ш 
picture series test. Meer (1952) has shown 05° 
Ss measuring high in authoritarianism report ?' £^ 
tility toward strangers in their dreams тоге 


n 


and nonvolunteer Ss, Rosen (1951) has foun is" 
the former were less subject to convention?" 
authoritarianism, and other personality charac ma 
istics presumably measured by the F scale. , g 4 
was interpreted as indicating that volunteer” 


= 
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not entirely a function of the situation, but is 
related to personality characteristics. 

Relying upon the evidence presented in The 
Authoritarian Personality that a fair estimate of 
ethnocentrism measured by the E scale could be 
Predicted from the F scale, we have reported the 
results of studies using either scale, Several 
studies, moreover, have independently reported high 
correlations between the two scales. Srole (1951), 
however, has raised a problem concerning this re- 
lationship. In a study in which a five item F 
scale and an ethnic minority social distance scale 
were administered in conjunction with a five item 
Anomie scale presumably measuring psychosocial 
factors related to social dysfunction or dis- 
organization, analysis of the data, suggested that 
the Г scale did not highly correlate with ethnic 
prejudice independently of anomic factors, and 
that prejudice was more closely related to anomie 
than to authoritarianism. Roberts, Rokeach, and 
McKitrick (1952), in a repeat study, however, 
found that prejudice and authoritarianism corre- 
lated .64 and prejudice and anomie .55, but when 
anomie and education were held constant, prejudice 
and authoritarianism correlated .48, whereas the 
correlation dropped to .28 between prejudice and 
anomie when authoritarianism and education were 
held constant. The writers concluded, therefore, 
that there is insufficient support for Srole's con- 
tention that anomie is more closely related to 
prejudice, and that prejudice and authoritarianism 
are not highly related independently, but that the 
presence of a low correlation between anomie and 
prejudice is worthy of further investigation. The 
interchangeability of the California scales in 
research, therefore, appears to be justified. 

With regard to the general validity of the Cali- 
fornia scales, our first concern, Eager and Smith 
(1952), using a modified form of the F scale, have 
shown that agreement existed between summer 
Campers’ perception of the degree of authoritarian 
behavior shown by counselors and the scores 
Obtained by the latter on the scale. Campbell and 
McCandless (1951) have reported results similar 
© those in The Authoritarian Personality. Using 
ап independently constructed scale | designed a 
Measure racial attitudes, their findings may С 
interpreted as evidence for the general validity o 
the California results. Е i 

Christie and Garcia (1951) empirically ps 
the hypothetical variables underlying the items b 
Ше F scale (conventionalism, authoritarian su 
mission, authoritarian aggression, аш пер 
Superstition and stereotypy, and projectivity ап 
s mn from thc 
Sex). The empirical clusters resulting =. 
analysis of interitem correlations appeared ШЕЙН h 
fairly well with the hypothetical ones, alt et 
some items did not fall into any of the define 


Clusters, 
Although there app 


Sround for claiming genera 
California scales and the un 


ears to be sufficient 
1 validity for the 
derlying vari- 


ables, it must be emphasized that actual be- 
havior itself is not necessarily the validating 


criterion of social attitudes. Thurstone 
(1954) has emphasized this important 
point: 


In studying the measurement of social attitudes, 
the attempt is sometimes made to validate such 
experiments in terms of overt behavior, but that is 
an еггог.... А man may be entirely consistent 
in what he says and in what he does about a con- 
troversial issue, and yet both of these indices may 
be dead wrong in reflecting his attitude . . . . 
Attitudes are essentially subjective experiences 
which may or may not conform with overt action 
(pp. 51-52). 

The suggested criterion for validation, ac- 
cording to Thurstone, can be ascertained 
only through intimate face to face private 
conversation. It seems, therefore, that the 
above mentioned studies constitute more a 
measure of consistency between overt be- 
havior and attitude scale scores than an 
index of the validity of the scales. The 
scope of the present paper does not permit a 
further discussion of this important point. 

Our second major concern involves the 
meaning of low and high scores on the Cali- 
fornia scales. It was previously mentioned 
that the items of the scales were favorable 
to authoritarianism, as reflected, for ex- 
ample, in the F scale item, “Obedience and 
respect for authority are the most impor- 
tant virtues children should learn.” 

One problem presented by a scale con- 
taining items of this nature involves the 
general tendency for some Ss to give posi- 
tive responses independent of content. Cohn 
(1953), for example, reports a significant 
.25 correlation between the F scale and 33 
items selected from the Minnesota Multi- 
phasic Personality Inventory. Many of the 
items used by Cohn, however, seem to be 
in agreement with the underlying F scale 
variables. The MMPI items “At times I 
feel like smashing things” and “At times I 
have a strong urge to do something harmful 
or shocking” seem to be possible additional 
“authoritarian aggression” F scale items; 
and “When I take a new job, I like to be 
tipped off on who should be gotten next to” 
is suggestive of the “power and ‘toughness’ ” 
variable. It is, therefore, surprising that the 
correlation between the F scale and the 
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MMPI items was not higher. The correla- 
tion may have been weakened by the inclu- 
sion of certain MMPI items which, although 
authoritarian in flavor, were more of the 
"extreme liberal" variety, as, for example, 
the item, "Some people are so bossy that I 
feel like doing the opposite of what they 
request, even though I know they are right." 
The effects of the general tendency to re- 
spond positively, therefore, is open for fur- 
ther investigation. 

Another problem relevant to the present 
study regarding the construction of the 
California scale items involves the dubious 
assumption that low scorers rejecting anti- 
democratic beliefs are thereby necessarily 
"democratic" or "liberal." Whereas, a high 
Score may reasonably reflect an inclination 
toward fascistic ideology (since high scores 
indicate agreement with the explicit content 
of the items), low scores merely reflect the 
absence of such agreement. The beliefs and 
attitudes imputed to the low scorers are, 
therefore, inferred. 

It follows that when using the California 
scales high scorers as a group would be 
more homogeneous than low scorers, since 
the former have common grounds for agree- 
ment while the basis for low scorers' dis- 
agreement remains varied and unknown. It 
is not the agreement per se which produces 
greater homogeneity among high scorers, 
but rather the unmistakable conservative 
and authoritarian quality of the items which 
suggest the identity of agreers. We cannot 
infer, unfortunately, the reasons which 
prompt disagreement. There is little doubt 
that extreme liberality is one reason, and 
probably constitutes a large percentage of 
the low population. Reasons, however, 
other than extreme liberality may be in- 
fluential. These may include: strong opposi- 
tion to any opposing ideology while main- 
taining moderation in one's own beliefs; 
general personality traits such as the tend- 
ency to disagree; resentment against un- 
qualified and ambiguous items; or, merely, 
an expression of disagreement with the test 
situation itself. 


There are additional reasons for suspect- 
ing that a low score on the California scales 


TAYLOR 


does not necessarily reflect liberal predis- 
positions. There is the finding in The Au- 
thoritarian Personality that low scorers as a 
group were more diverse than high scorers, 
and were categorized in several subtypes, 
one of which was termed the "rigid low." 
Other studies, moreover, have found greater 
similarities among high scorers than among 
low scorers. Rokeach (1950), for example, 
found more consistency among highs than 
among lows, and Meer (1952) did not find 
significant differences among the lows, al- 
though the pattern was clear for the highs. 

To test the hypothesis that low scores on 
the F scale do not necessarily reflect extreme 
liberality, the investigator administered a 
12 item F scale in conjunction with a modi- 
fied version of the Bogardus Social Dis- 
tance Scale, involving ethnic minorities, to 
21 secondary school Ss. In confirmation, 
it was found that whereas approximately 
67% of the Ss were in the high third of 
both scales, and 50% in the middle third of 
both scales, only 29% of the lowest scorers 
on the F scale were the lowest scorers on 
the social distance scale. Low scorers on 
the F scale, moreover, did not show more 
indication of ethnic minority acceptance 
than the middle F scale scorers. In fact, 
there was some indication of curvilinearity 
to the extent that the mean on the social 
distance scale for low Г scale scorers was 
slightly greater than the mean on the social 
distance scale for middle F scale scorers. 
The high scorers on the F scale, on the 
other hand, showed more consistency in 
agreement with the social distance scale. 

Since the above study was limited to too 
few Ss, we cannot safely generalize, The 
implications, however, were important to 
the present research in which comparisons 
were made between groups opposed in 
social attitudes. It was largely for this rea- 
son that we have defined the extreme liberal, 
extreme conservative, and the intermediate 
groups on the basis of two independent cri- 
teria—those demonstrating consistency in 
score on both the F scale and social distance 
scale. This precaution increases the proba- 
bility of correctly identifying the position 
of Ss along a general social attitude con- 
tinuum. 


————————— 
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A third major concern, and the most im- 
portant, is with regard to the assumption of 
linearity of social attitudes. Linearity was 
largely taken for granted by the investiga- 
tors of The Authoritarian Personality, as 
reflected in their decision to limit interviews 
to extreme scorers. The exclusion of inter- 
mediate scorers was unfortunate, for it pre- 
vented a comparison between the middles 
and extremes. The entire research, as in the 
case of many subsequent investigations, was 
geared apparently toward differentiating 
lows and highs, thereby obscuring any curvi- 
linearity that may have been present. 

Hyman and Sheatsley (1954) have raised 
this same methodological criticism, object- 
ing further that “The findings obtained 
from interviews with opposing extreme 
groups are generalized to the non-extreme 
middle groups” (p. 69), and that “Research 
studies have frequently revealed a curvi- 
linear relation between social attitudes and 
various determinants of those attitudes, and 
a number of investigations have indicated 
that extreme individuals differ from others 
in the degree of organization of their senti- 
ments and in the relation of such sentiments 
to personality factors” (p. 66). For some- 
what different reasons, Luchins (1950) has 
raised a number of similar criticisms, and 
suggests the importance of having inter- 
viewed a sample of Ss in the middle cate- 
gory as a necessary control for the linear 
assumption. 

Actually, the important issue involved 
here is the implication which follows di- 
rectly from the assumption of linearity. 
This is the contention that there are basic 
differences between the personalities of ex- 
treme scorers. As a consequence, the mid- 
dle group becomes an unimportant link be- 
tween extremes which can be justifiably 
eliminated. Although there are a number 
of alternatives to the assumption of line- 
the one that concerns us here is 
y, which stresses basic similari- 
ality of extremes. The 
becomes the decisive fac- 
nted in relation to 


arity, 
curvilinearit 
ties in the person 
middle group then 
tor and must be prese 
extremes. 

_ Linearity and th 
Чоп of extremes is exp 


e contention of opposi- 
licit in the following 


discussion by J. F. Brown (1936) concern- 
ing the similarities and differences between 
the fascist and communist dictatorships : 

The basic “purpose” of the fascist dictatorship 
is to assure freedom of economic locomotion to the 
bourgeoisie. The basic “purpose” of the communist 
dictatorship is to assure freedom of economic loco- 
motion to the proletariat. . . . this variance in the 
class basis of the dictatorships makes them as 
different from each other as black from white. 
The similarities from the standpoint of a field- 
theoretical analysis are of a very superficial nature, 
the differences are basic [Italics added] (p. 410). 

In an almost identical context, Shils 
(1954), some two decades later, discusses 
what appears to be a similar issue, but with 
the alternative implication of basic similari- 
ties between extremes and the superficiality 
of differences: 

Obviously a Fascist who says that the Jews have 
monopolized almost all the important posts in the 
Government is concretely different from the 
Bolshevik who asserts that the small circle of big 
business men control not only the economic life 
but the intellectual, political, and religious life of 
the country. Concretely these two views are very 
different—one we know to be the usual paranoid 
anti-Semitism—the other sounds like a somewhat 
crude social science proposition in which many 
intellectuals in the West believe. Yet looked at 
from another point of view, they are strikingly 
similar [Italics added]. Both aver that a small 
group has with doubtful legitimacy concentrated 
the power of the country in their hands (p. 32). 

Thus, with striking contrast, the same 
phenotypic situation is considered genotypi- | 
cally different by Brown, but similar by 


Shils. 

In another place, Brown asserts that there 
are “. . . basic differences [between the 
fascist and communist state] in economic 
freedom for the different classes and the 
different positions taken with regard to 
nationalism, racial minorities, education, the 
arts, position of women, internationalism, 
etc" (p. 411). Since the publication of 
Brown's book, however, many of these dif- 
ferences are no longer apparent. The de- 
velopment of communistic ideology has in- 
creasingly come to resemble fascism in 
recent years, revealing similarities previ- 
ously obscured. Hoffer (1951), thus, points 
out an important similarity in an area be- 
lieved by Brown to involve basic differ- 
ences: “The Bolshevik and Nazi revolu- 


8 IRVING A. TAYLOR 


tions are also full-blown nationalist move- 
ments. The Nazi revolution had been so 
irom the beginning, while the nationalism 
of the Bolsheviks was a late development" 
(p. 18). 

When comparisons are made between ex- 
tremes in which the California scales are 
not employed, and when an examination is 
also made of the middle group, the results 
frequently indicate similarities rather than 
differences. Vetter (1930), in studying 
groups along a reactionary-radical con- 
tinuum, has reported a curvilinear relation- 
Ship between extremes with regard to intro- 
version. Reactionary men scored 16.1 and 
radicals 18.5, but intermediate conservatives 
scored only 14.8 on the Laird Personal In- 
ventory. He also found that while radicals 
had an average income of $7,100 and reac- 
tionaries of $7,700, the intermediate group 
had an income of $10,500. One of the re- 
sults of the present research clearly supports 
Vetter's findings. Allport (1937) has sug- 
gested that “a liberal is not merely a mild 
radical; in many respects he has a wholly 
different outlook. Nor is a reactionary 
merely an ultra-conservative, for he resem- 
bles the radical in many respects: he too 
favors change (albeit towards the old rather 
than the new) ; he too is discontented, and 
‚ ++ Seems to be as excitable and malad- 
justed as the chronic radical" (р. 431). 
Dombrose and Levinson (1950), indeed, 
have recently confirmed Allport's hypothesis 
that “а liberal is not merely a mild radical," 
by showing that the low quartile scorers on 
the E scale prefer militant measures in dis- 
tinction to the middle low quartile scorers’ 
preference for pacifist programs. The writ- 
ers conclude that *, . . the greater the in- 
tensity of support for democratic ideology, 
the greater the tendency to prefer militant 
forms of social action” (p. 107), and that 
i the militant low would attempt to 
change whatever social patterns . . . he re- 


gards as responsible for the growth ... of 
antidemocratic groups and ideologies" (p. 


108). The findings of Dombrose and Levin- 
son support the contention that the extreme 
low scorer on the California scales favors 
a form of behavior frequently associated 
with the high scorers. 


© 


- 


A study by Haimowitz and llaimowitz (1950) 
supports the above contention. Twenty-four Ss 
invited to engage in a therapeutic training session 
were administered the Bogardus Social Distance 
Scale involving ethnic minority groups. Several 
weeks later, the scale was readministered. Also in- 
cluded in the second social distance scale was 4 list 
of groups suspected to be at the focus of the ех- 
treme liberal’s hostility. The results indicated that 
Ss most friendly to ethnic minorities were most E 
tile to antiminority groups, a finding which seems t 
be in opposition to one of the results of the prest? 
study. Of the 24 Ss used, only the middle grou? 
showed any change during the training period, and, 
with regard to extreme individuals, the authors 
state that modification Of “, such [social 
attitudes is more difficult in these people than in 
others who hold to attitudes of cither friendlin 
or hostility with less inter Thus, the varia S 
which makes for modifiability of attitudes. is not e 
much the factor of directionality of айпа 
it is of intensity’ [italics added] (р. ^ 
The writers continue that ". . , persons who и 
not exclude etlinic groups may be 'anti-democ7 
in their generalized response to the world" (p. = i 
On the basis of several other pertinent findings, 
authors conclude, “ . . . the two extreme gt ге 
were more similar to each other than they b n- 
to the intermediate group” (p. 240). This nsed 
clusion is clearly in agreement with the one pa 
on our results. di 

An affinity between the extreme groups has be 


v d s ET 
strongly asserted by the historian Schlesing? 
gly j 1 Е 
(1949), who states, " from one viewpor 
the similarities [between the communists ifi- 


fascists] are vastly more overpowering and sig” he 
cant than the differences . . . the passage from jas 
extreme left to the extreme right and back 51) 
been fast and easy” (рр. 59-60). Hoffer (19 Е 
in a discussion of the interchangeability of as 
movements, similarly states that "When people © 
ripe for a mass movement, they are usually 
for any effective movement, and not solely for 
with a particular doctrine or program. Ш ther 
Hitlerian Germany it was often a tossup whe the 
a restless youth would join the Communists or 
Nazis" (p. 16). ie 
Shils (1954), in close agreement with the ° ит 
states that “ . . . there is almost always а 5!8 ie" 
cant number of low scorers whose deeper the 
positions are closer to the high scorer than б, ей 
low scorer. ln conformity with the preconc’ the 
idea that authoritarianism is a characteristic x 
Right and the corresponding notion that RS no 
no authoritarianism on the ‘left,’ there 32). 
analysis of these deviant low scorers” (P. “rou? 
Krugman (1952) has indeed investigated а eat 
that may be considered as “deviant low sco" ter^ 
Ex-members of the Communist Party were x of 
viewed, and the author reports many instance iori 
behavior bearing a striking similarity to aur 
tarian practices associated with the "right. 


one 
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There appears, however, to be an incon- 
sistency inherent in the notion of "deviant 
low scorers,” as exemplified in the above 
passage by Shils in which he states that 
some low scorers are "closer to the High 
Scorer than to the Low Scorer.” There 
seems to be the implication that there is a 
subvariety of renegade low scorer similar 
to the high scorer, and that the other low 
scorers are the genuine liberals having per- 
sonality characteristics basically different 
from the high scorer. This implication is 
also apparent in the following statement by 
Frenkel-Brunswik (1954) : 

Such concepts as that of the ‘rigid low,’ intro- 
duced in our book . . . amply testify to the fact 
that we have not lost sight of the possibility that 
there may be curvilinear relationships among the 
vario titudes or traits of personality and that 
certain sub-varieties of the non-ethnocentric per- 
sonality likewise exhibit the closeness of opposites 
that is bound to accompany intolerance or cognitive 
ambiguity or emotional ambivalence (p. 258). 


This inconsistency can perhaps be re- 
solved by considering only the "rigid lows" 
as the extreme liberals, and the remaining 
low scorers as individuals strongly disagree- 
ing with the items of the California scales 
for reasons other than extreme liberality. 
This interpretation is congruent with our 
previous discussion of the meaning of alow 
score. 

Frenkel-Brunswik (1954) has also stated. that 
"In our view, the susceptibility to ethnic prejudice 
itself is but another case of ‘alternative’ behavior" 
(р. 261), and, at another time, t “The relatively 
incidental ‘choice’ between different alternative 
manifestations of the same underlying tendency 
[italics added] is most crucial for the development 
of personality. Thus, whether a person is ethno- 
centric or liberal is closely related to whether his 
underlying aggression is accepted or not . . ." 
(pp. 265-266). Allport and Kramer (1946) imply 
а similar thought in suggesting that " . . . the 
feeling of being a victim of prejudice on the part 
of the [Jewish] subjects may lead to sympathy ог 
antipathy toward the Negro” (p. 29). 


There appears to be a conflicting tend- 
ency discernible in the literature to con- 
sider, on the one hand, only the “rigid low” 
lar to the extreme conserva- 
her hand, to regard all 
tially similar. The 
limit the category 
‘rigid low” scorer 


as basically simi 
tive, and, on the ot 
extreme liberals as essen 
proposed suggestion to 
“extreme liberal" to the ' 


only has the benefit of resolving this dis- 
crepancy, and also has the advantage of 
allowing for an understanding of the mani- 
fest differences as instances of one geno- 
type. 

In an exploratory study by B. Taylor 
(1952), which used an instrument for meas- 
uring closure similar to the one used in the 
present research, a significant positive re- 
lationship between closure and the F scale 
was found. The median division was used 
to differentiate low and high scorers. When 
Ss were divided into quarters, however, and 
a mean perceptual closure score was com- 
puted for each group, a tendency toward 
curvilinearity became apparent. 

In a second exploratory study in which 
a social distance scale involving ethnic 
minority groups was administered to sec- 
ondary school children in conj unction with 
the F scale and a perceptual closure scale, 
Ss were selected scoring in the low, middle, 
and high third on both social attitude scales. 
Those falling in the extreme categories ex- 
hibited the greatest closure tendencies, as 
predicted. It appeared, therefore, that per- 
ceptual closure tendencies occur for both 
extremes when independent measures are 
used conjointly to estimate the intensity of 
social attitudes. The sample, however, was 
too small for confident generalization. 


Perceptual Closure 


Reviewing the relevant literature on per- 
ception, particularly on perceptual closure, 
we find that the term has been used in 
widely different contexts. Ever since its 
introduction by the Gestalt psychologists its 
usage has been extended beyond the original 
limitation to perception. The following 
statement by Hochberg and Gleitman 
(1950), in discussing a nonseparatist theory 
of motivation, exemplifies this pervasive ех- 
tension of the term: 

Such a theory of motivation would consider all 
motivational phenomena as manifestations of 


equilibratory processes occurring in the psycho- 
physical field. From this point of view, tue 
formally simplest motivational phenomenon might 
be—not the rat running towards the food, not 
the infant establishing object-relations with its 
mother—but the much more prosaic case of an 


incomplete circle tending to close (p. 188). 
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As a basic concept of Gestalt theory, the 
term generally refers to an organizational 
process in which objects are perceived as 
complete or as "good" a figure as possible 
under the conditions obtaining. The term, 
however, as pointed out by Bobbitt (1942), 
has been expanded to include “ . . . any psy- 
chological phenomena in which a condition 
of incompleteness, either in the stimulus 
field or in some phase of the organism's 
activities, creates a tendency to overcome 
this incompleteness by perceptual reorgani- 
zation, by a combination of two or more 
different experiences, or by some overt ac- 
tivity of the organism" (p. 274). The gen- 
erality of the term is similarly indicated by 
Drever's (1953) definition of closure as a 
principle ". . . describing the process by 
which percepts, memories, actions, etc., at- 
tain stability, viz. the subjective closing of 
gaps, or completion of incomplete forms, so 
as to constitute wholes" (p. 40). 

Although the early Gestalt formulation 
of perceptual closure regards the tendency 
to close gaps as revealing а fundamental 
structural principle of brain dynamics, other 
investigators have tended to emphasize the 
role of past experience in determining the 
perception of a figure. Bruner (1951), for 
example, states that “. . . the process of 
closure . . . is an abstraction based upon а 
sampling of various situations in which dif- 
ferent expectancies have made different 
trace systems available for communication 
with input stimulus information" (p. 310). 

The general characteristics of completion ten- 
dencies have been carefully investigated and 
described in an early study by Gibson (1929). 
Using relatively meaningless geometric forms con- 
taining gaps or breaks to be reproduced by Ss 
Gibson showed that completing tendencies occurred 
for the group as a whole, and he concluded that 
"Completion occurs when the stimulus figure is 
apprehended as a single form, despite the gaps, 
and when the reproduction is drawn with a con- 
tinuous contour" (p. 27). Gibson is also inclined 
to relate his results to previous experience rather 
than to laws of configurations. Gibson also pointed 
out the important feature of differences between Ss 
with regard to degree of closure. Only 1596 of the 
group closed gaps completely. Similar results were 
obtained in earlier studies by Hemstead (1901) 
and Meakin (1903). 

'Thurstone (1944), in an extensive analysis of the 
factors involved in perception, has isolated an im- 


portant one which he has called "closure," 
described as being concerned with the facility, 
rather than acuity, in perceptual closure. An 
examination of the tests found to be loaded with 
this factor “ . seem to represent the ability 
to form a perceptual closure against some dis- 
traction. They also represent the ability to hold a 
closure against distraction" (р. 101). Thurstone 
further points out three specific usages of ш 
term: (а) when what appears to be unrelate 
and unorganized suddenly appears as a means 
fully related whole; (b) when a part of Me 
presented items are suddenly perceived as a unity 
and the remaining parts are perceptually dis 
carded; and (c) when a good configuration 1 

to be destroyed in order to discover a differen 
configuration (p. 20). It would seem that Li 
usages are more closely related to other concep’ 
than to closure: the first, to "insight"; the secon A 
to “field and ground”; and the third, to “flexibility: 


Moreover, these usages deviate far from ied 
original concept of perceptual closure, Бш еу 
all three usages involve a principle оѓ disc ent 


and ambiguity which is not an essential comp? 
of the process. E 
Although the above usages of the he 
differ largely from the one used here, 4) 
following statement by Thurstone (194 i 
clearly expresses an important propositi? 
underlying this study : я 
The fundamental hypothesis involved here is ШШ 
the dynamics of perception, and of other restri? ^f 
functions, are not isolated and that these seg 
functions are so related that some characteris fe 
the person as a whole might be inferred fro™ 5 
dynamics of one of these functions, In E 
days when we insist so frequently on the integ 
dependence of all aspects of personality, it wou! 
be difficult to maintain that any of these function 
such as perception, is isolated from the rest 
the dynamical system that constitutes the Рег 
(p. 3). 
z 1 ег” 
Roseman (1952), in fact, has related ГУ 
sonality organization to perceptual clost 
measured by a technique following АЕ at 
stone's first usage of the term (when W^ 
appear to be unrelated items suddenly ap" 
pear as a meaningfully related whole)- P 
chiatric patients and normal Ss were £l" to 
groups of parts that could be formed б 
single geometric figures. Of the 29 йоз ate 
items the normal Ss succeeded in co™P E 
ing slightly fewer than 22, while the PSY e 
atric patients completed slightly fewer an? 
18. The difference between the two ™ 
was significant. 


Relevant also is the study by p 
Brunswik (1951), where children were 
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sented with a series of cards showing a 
gradual transformation from a dog to a cat, 
a technique closely exemplifying the third 
usage of closure suggested by Thurstone 
(where a good configuration has to be de- 
stroyed in order to discover a different 
one). The present study shares in common 
with the above study the belief that there is 
a relationship between the closure of open 
figures and the “closure” of open social 
issues involved in extreme social attitudes. 
The hypothesis here, however, is that the 
extremes—both the “left” and the “right”— 
would demonstrate closure tendencies. 


DEVELOPMENT OF THE EXPERIMENTAL 
MATERIAL, DESIGN, AND THE 
SELECTION OF SUBJECTS 


An estimate of antidemocratic or authori- 
tarian tendency was obtained with the use 
of the 30 item revised California Fascism 
Scale (F scale). The scale has a reported 
reliability coefficient of .90 and correlates 
approximately .75 with the California Eth- 
nocentrism Scale (E scale). Likert-type 
scaling is employed, and a score reflects the 
degree to which an individual accepts or 
rejects implicit antidemocratic beliefs. The 
F scale is asymmetrical; all of the items are 
stated in antidemocratic form (Adorno et 
al., 1950). A copy of the F scale is included 
in Appendix A. 

Additional measures of social attitudes were 
Obtained through the use of a social distance scale 
containing two subscales, a minority subscale with 
10 ethnic minority groups and an authority sub- 
scale with 10 authority groups (see Appendix A). 
Measures of overt hostility or friendliness toward 
the two groups were obtained simultaneously. 

The social distance scale is a questionnaire in 
which the individual indicates the degree to which 
he will accept members of various groups, “Social 
distance” has been defined by Bogardus (1933) as 
“the degree of sympathetic understanding that 
exists between two persons or between a person 
and a group” (p. 265). The score is the number 
of checked inclusions of groups for the following 
situations : . р 

1. To close kinship by marriage 

2. To my club as personal chums 

3. To my street as neighbors 

4. To employment in my occupa 

5. То citizenship in my country 

6. As visitors to my country " 

7. Would exclude from my country 


tion in my country 


The score reflects policy orientations rather than 
feelings towards groups. The final category, 
*Would exclude from my country," was used as 
a check against the preceding acceptance cate- 
gories, i.e, a S checked either the categories 1 to 6, 
or only category 7. 

The ethnic minority groups included Negroes, 
Turks, Mexicans, Puerto Ricans, and Hindus. 
Representative authoritarian or antiminority groups 
suspected to be at the focus of the extreme 
liberals’ hostility were Fascists, Nazis, Reaction- 
aries, Ku Klux Klan, and Totalitarians, The 
minority and authority groups were presented in 
random order. Although the social distance scale 
has several inherent weaknesses, it was considered 
feasible as a supplementary indicator of social 
attitudes. 


Two scales were developed for measuring 
perceptual closure (see Appendix В). Pre- 
vious results with the use of several similar 
instruments have indicated that meaningful 
figures afford more reliable scores than rela- 
tively less meaningful geometric forms. 
After preliminary pilot studies, the outlines 
of a house and a boy were selected. Each 
figure was constructed of several lines con- 
taining small gaps or breaks. The figures 
were presented to the Ss with instructions 
to reproduce them "just as they are seen." 

Below the figures, all terminal points were 
provided for establishing the general out- 
line upon which the figure was to be repro- 
duced. Holding the size of the lines of the 
reproduced figure equal to those of the fig- 
ure provided a standard basis for measuring 
the size of the reproduced gaps, thus avoid- 
ing the necessity for using ratios. Percep- 
tual closure tendency is reflected by the 
mean size of the gaps in the reproductions. 
The size of all the gaps in the standard was 
26 mm. The lower the score, therefore, the 
greater the closure tendency. 

In order to test the hypothesis that the 
scores obtained on the “house” and "boy" 
scales reflected perceptual closure, or the 
tendency to complete openings in a single 
configuration, rather than mere completion 
tendencies, chance habits of writing, etc., 
the design scale was constructed as a con- 
trol, in which completion tendencies between 
several figures, rather than one, could be 
measured (see Appendix B). The size of 
the gaps in the design scale was also 26 mm. 

On the second sheet of the Е scale, infor- 
mation was requested about the age, Sex: 
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religion, major, education, and estimated 
yearly income of each 5. With the excep- 
tion of age and academic major, the infor- 
mation was obtained in the form of check 
lists. Ss were allowed to remain anonymous. 


The entire population studied was taken from 
classes at the University of Houston during the 
summer of 1953, and numbered 253. The Ss were 
selected from classes in psychology, education, and 
English. Table 1 summarizes the population 
characteristics of the group. Age ranged from 17 
to 61, with a mean of 2746 and a standard 
deviation of 8.81. The median age of 24.43 is a 
more appropriate estimate since age was not 
normally distributed. Ss were heavily concentrated 
at the lower age intervals. Since the group was 
comprised of many teachers and other professional 
people attending summer sessions, the average age 
was somewhat larger than usual. The ratio of 
men to women was approximately 3:2. Nearly 
80% of the group were Protestant, 170 were 
Catholic, and 3% were Jewish. 


Since not all students were expected to be gain- 
fully employed, an estimate of the parents' yearly 


income was requested for determining the Ss 
approximate economic level. The mean yearly 
income reported was $5,600, with a standard 
deviation of $1,920. Approximately 7% did not 
provide this information. 

Table 1 also indicates that the number of 
students increases with the level of educa- 
tion, with graduate students comprising 
more than one quarter of the population. 
АП levels of education, however, are well 
represented. The average level is somewhat 
above Junior. 

The population was heterogeneous with 
regard to academic major. Practically al 
areas oí study were well represented with 
business majors comprising the largest 
group (21%) and natural science majors 
the smallest (666). 


‚ АП 

Twelve groups were tested separately: q 

attempt was made to duplicate the pss 
dures as fully as possible in each insta? 


TABLE 1 
POPULATION CHARACTERISTICS OF THE GROUPS STUDIED* 
(253 Ss) 

Population Population & 
Characteristic N [4 Characteristic N 0 
Азе Education 5 
55-64 2 .8 Graduate 67 ж 
45-54 14 5.5 Ѕепіог 58 2r 2 
35-44 31 [2.3 Junior 46 ud 
25-34 77 30.4 Sophomore 44 1j `2 
15—24 127 50.2 Freshman $1 д» 
Unknown 2 .8 Unknown 7 ds 

Parental income Major 

$9000-10000 44 17.4 Busines 53 21-0 
7000-8000 30 11.9 Education 52 20.6 
5000-6000 72 28.5 Liberal Arts 40 15e 
3000-4000 74 29.2 Social Arts 34 Lay 
1000-2000 16 6.3 Biological " 

Unknown 17 6.7 Science 20 T i 

"Technology 19 o 
Na » 

Religion : pod 15 5.0 
Protestant 195 mai Unknown 20 ps 
Catholic 43 17.0 
Jewish 7 2.8 Sex 
Unknown 8 6.1 Меп 146 57.1 

Women 103 10-7 
Unknown 4 1. 


в Combined from 12 psychology, education, and English courses at the University of Houston. 
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The initial instructions were as follows: 

This morning (afternoon, evening) I would like 
you to take a few public opinion questionnaires. In 
answering the questions it is important that they 
reflect your own feelings. If you are not certain 
about some of these issues, your first impression 
is probably the best. There are, of course, no right 
or wrong answers. Please do not put your name 
on any of the papers, since we are only interested 
in the opinions of the group as a whole. Use only 
the mechanical pencils that will be distributed. 

It is not known whether this anonymity 
helps increase validity, but it does aid in 
dispelling tension and suspicion. Standard 
mechanical pencils were used to increase the 
comparability of reproductions on the per- 
ceptual closure scales by eliminating certain 
unnecessary differences due to the writing 
implement. 

The F scale was distributed with an indi- 
cation that no one was to begin until instruc- 
tions were read aloud. During the adminis- 
tration of the questionnaire, answers to 
most queries concerning definitions and 
meanings were to the effect that responses 
should be in accordance with one's own 
interpretations. 

Essentially, the same procedure was used 
in the administration of the social distance 
scale. The Ss were requested to hold all 
papers until the completion of the entire 
battery. 

The perceptual closure scales and the 
design scale were distributed together with 
the following instructions : 

This part of the study is not a public opinion 
questionnaire; it is another kind of test. In each 
gi the three sheets, you are to copy the rest of the 
паша in the space below using the dots as guides. 
осе that parts of the figure are already copied 

Ow. As soon as you complete the first, go on to 


the next, Do them i = " 
m 9 in the order given (“House, 
Boy," "Design"). er given ( 


Work accurately and quickly. Draw free hand 
lines as straight as you can with the mechanical 
pencil. It is very important that you copy the 
rest of the figures by eye only, 

The instructions were intended to pro- 
a subjective, rather than an objective, 
there were few questions 
purpose of these scales, 
bably due to the interest 
ovelty. The scales were 
single one-hour class 


duce 
set. T ortunately, 


concerning the 
which was pro 
aroused by the n 
completed within à 
Session. 


The item mean score for the 253 Ss on 
the F scale was 3.63, with a standard devia- 
tion of .82, which is below the F scale’s 
average of 4.00. 

One important requirement of a social 
distance scale is that acceptance increase 
with progressive levels of the scale. If, for 
example, a S indicated that he would accept 
Puerto Ricans “to close kinship by mar- 
riage,” then Puerto Ricans should be ac- 
cepted on all preceding levels (except for 
the last level, “Would exclude from my 
country”). The degree to which progres- 
sive increase in acceptance is actually ob- 
tained is indicated by the frequency of pat- 
tern reversals, e.g., accepting members of an 
ethnic group to one’s club as personal 
chums, but not to one's street as neighbors. 
There were 277 item reversals out of a total 
of 2,450 possible responses (since 245 Ss 
responded to 10 items) or 11.3% pattern 
reversals. This percentage of pattern re- 
versal was somewhat higher than expected 
and probably decreased the reliability of 
social distance scores to some extent. 

Table 2 shows the median and rank posi- 
tion for each of the groups contained in the 
social distance subscales. The five most ac- 
cepted were the minority groups, and the 
five least accepted were the authority 
groups. The division of groups into minori- 
ties and authorities, therefore, was psycho- 
logically meaningful. The median accept- 
ance for minority groups as a whole was 
somewhat beyond level 3 (“То my street as 
neighbors") while the median for the au- 
thority groups as a whole was at level б 
(“Аз visitors to my country"). Puerto 
Ricans were accepted "To my club as per- 
sonal chums,” while the Nazis and Ku Klux 
Klan would be excluded from the country. 
Negroes were the least accepted of the 
minority groups and Reactionaries were the 
most accepted of the authority groups. 
There were slightly more responses made 
to the minority than the authority items. 
The greatest amount of variability occurred 
for Reactionaries, which may have been due 
to the fact that many Ss did not know the 
meaning of this term. The least amount of 
variability occurred for the Negroes. Most 
of the Ss would neither accept nor reject 
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TABLE 2 


MEDIAN PREFERENCE AND RANK ORDER FOR MINORITY AND AUTHORITY ITEMS 
Ох THE SOCIAL DISTANCE SCALE* 


Social Distance Scale 


Minority Subscale 


Authority Subscale 


No. of No. of 
Item Responses | Md. Q Rank Item Responses | Md. Q Rank 
pee t -— 

Puerto Ricans 242 2.27 | 1.24 | 1 | Reactionaries 231 5.11 | 2.00 | 6 
Turks 237 | 2.74 | 1.47 | 2 | Totalitarians 232 | 6.12 | 1.08 | 7 
Mexicans 243 2.83 | 1.35 3 Fascists 241 6.46 363 8 
Hindus 241 3.44 | 1.80 4 Nazis 240 6.52 .69 | 9.5 
Negroes 240 4.06 | .57 5 Ku Klux Klan 238 6.52 | 1.58 | 9.5 
All items 1203. 3.30 | 1.31 АП items 1182 6.21 | 1.20 


* Medians interpreted as follows: 
1. To close kinship by marriage 
2. To my club as personal chums 
3. To my street as neighbors 


4. To employment in my occupation in my country 


Negroes beyond or below “To employment 
in my occupation in my country." The dis- 
tribution of scores obtained on the authority 
subscale was extremely skewed, correspond- 
ing to a “]” curve; scores for the minority 
subscale, however, were normally distrib- 
uted. The scale, although suitable for the 
minority items, apparently did not contain 
enough levels for the authority items at the 
rejection end. 

The closure scales were designed to meas- 
ure completion tendencies under closure 
conditions, 1.е., when an interrupted line is 
perceived as forming the outline of а single 
configuration. The combined systematic 
error for the group, shown in Table 3, was 
—4.43 mm., ога 17.0% underestimation of 
the size of the gaps. Completion tendencies 
on the Boy scale was 1.12 mm. greater than 
for the House scale. The difference is 
highly significant (t = 4.549). Ten Ss in the 
case of the house scale, and six Ss with 
regard to the boy scale did not reproduce 
the configurations. The reliability coeffi- 
cient for the house scale was .85; for the 
boy scale it was .90. 

One of the major purposes of the design 
scale was to test the assumption that closure 
tendencies occur only when the interrupted 


5. To citizenship in my country 
6. As visitors in my country 
7. Would exclude from my country 


outline is perceived as a single configur?" 
tion. Table 3 also indicates the results fot 
the design scale. The systematic error #0" 
the group as а whole was only .28 mm., 07 
1.196 overestimation. There is a 4.71 m^. 
difference between the combined mean ? 
the closure scales and the design scale mean 
which is statistically significant beyond ® 
1% level. This result is interpreted as SUP" 
porting the above assumption. 


TABLE 3 


MEANS AND SysTEMATIC ERRORS IN THE 
REPRODUCTIONS OF STANDARD OPENINGS 
IN THE PERCEPTION SCALES? 
(Standard opening: 26 mm.) 


Closure Scales К 
Desig" 
Scale 
Both House 
Item | House Boy and Boy 
e 
N 243 247 242 251 3 
м 22.13 | 21.01 | 21.57 26-00 
SD 4.48 | 4.36 4.04 2-28 
Еггог —3.87 | —4.99 —4.43 1 
error | 14.9 | 19.2 17.0 Ls 
* Errors are differences in mm. between means and standard 
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It was expected that the F scale would be 
consistently related to the minority and au- 


thority social distance subscales. A Pearson 
correlation coefficient of .32 (standard error 
.06) was obtained between the F scale and 
the minority subscale, significant at the 1% 
level of confidence. The correlation at first 
appears to be smaller than might be ex- 
pected since social attitudes are known to be 
highly generalized. The size of the corre- 
lation, however, becomes understandable if 
it is noted that the two scales measure social 
attitudes at two distinct levels (authoritari- 
anism at an implicit, central level and social 
distance at an explicit, peripheral level). 
Even more important, both measure oppo- 
site aspects of social attitudes; extreme 
“conservatism” is measured directly by the 
F scale, and extreme “liberalism” directly 
by the minority subscale. 

An unexpected negative correlation was 
obtained between the F scale and the au- 
thority social distance subscale. Examina- 
tion indicated a highly skewed distribution 
of scores for the authority subscale which 
prevented the direct application of a Pear- 
son coefficient of correlation. A contingency 
coefficient of —.23 was obtained. The cor- 
rected coefficient, comparable to a Pearson 
correlation, was —.33. . 

Table 4 indicates this negative relation- 
Ship between the F scale and the authority 
Subscale. The division of scores into low 
and high categories was made at the medians 
Of the respective scales. A chi square of 


ABLE 4 


DV ds LATIONSHIP BETWEEN THE CALIFORNIA 
ї SCALE AND THE AUTHORITY SUBSCALE 
OF THE Зостат, DISTANCE SCALE 
(Median division of Ss) 


Authority F Scale 
Subscale 
Both 
Low High Groups 
Acceptance 78 49 127 
po ection 44 71 115 
oth Groups 122 120 242 


TABLE 5 


THE RELATIONSHIP BETWEEN THE SUBSCALES 
OF THE SOCIAL DISTANCE SCALE 
(Median division of Ss) 


Minority Subscale 
Authority 
Subscale 
T Both 
Rejection | Acceptance | Groups 

Acceptance 42 85 127 
Rejection 67 48 115 
Both Groups 109 133 242 


12.955, significant at better than the 196 
level of confidence, indicated that there was 
a tendency for Ss scoring high on the F 
scale to reject authority groups to a greater 
extent than low scorers. 

The correlation between the two sub- 
scales of the social distance scales was 
found to be positive. It was implicitly as- 
sumed in this study that extreme liberals 
would accept ethnic minority groups and 
reject authority groups, and that conserva- 
tives would reverse this order of preference. 
А contingency coefficient of .25 or a cor- 
rected coefficient of .37 was obtained be- 
tween the subscales. The relationship is 
shown in Table 5 where the median point 
is again used to differentiate low and high 
scores. А chi square of 15.465, significant 
at better than the 1% level of confidence, 
indicates that Ss tend either to accept both 
minority and authority groups, or reject 
both. Social distance, therefore, appears to 
be generalized, i.e., independent of specific 
groups. 

d The Pearson correlation coefficient be- 
tween the two closure scales was .62 (stand- 
ard error, .04), which when corrected for 
71. The scores on 

les were combined into a single 
epee closure. The combined closure 
scores correlated — 01 with the scores on 
the design scale (standard error of the р 
relation .07), which was not ا‎ 
different from a zero correlation. The ко d 
correlation between the closure үс я 
the design scale indicates that the pro 


attenuation. becomes 
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involved in reproducing the size of gaps 
within a single configuration is different 
from those processes involved in reproduc- 
ing the size of gaps between nonunitary 
figures. 

It was initially intended that the selection 
of samples corresponding to extreme and 
intermediate social attitude categories would 
be determined on the basis of the F scale 
and both subscales of the social distance 
scale. Extreme liberals were originally de- 
fined as rejecting authoritarian beliefs, ac- 
cepting ethnic minorities, and rejecting au- 
thoritarian groups; extreme conservatives, 
as accepting authoritarian beliefs, rejecting 
ethnic minorities, and accepting authoritar- 
ian groups ; and the intermediate groups, as 
moderate with regard to authoritarian be- 
liefs and social distance. The social attitude 
categories of "low," “middle,” and “high” 
were determined on the basis of quartile 
divisions, although the "middle" group was 
the middle 2596 on all three scales. The 
middle quarter was defined as the combined 
upper half of the middle-low quarter and 
the lower half of the middle-high quarter. 
Eliminating the outer half of the two mid- 
dle quarters served the purpose of pro- 
ducing a distinct gap between the low, 
middle, and high categories. Using this 
sampling procedure, it turned out that of 
the 253 Ss only two Ss could be described 
as "low," six as “middle,” and one as 
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"high." This result was due to the un- 
anticipated negative correlation between P 
scale scores and authority subscale scores, 
and also to the positive correlation between 
the authority and minority subscales. It be- 
came necessary, therefore, to modify the 
original sampling procedure. 

Since the minority subscale correlated 
consistently with the F scale, the best alter- 
native was to determine social attitude cate- 
gories from these two, and to eliminate the 
authority subscale. This modification was 
not serious since it was the original purpose 
to obtain additional direct measures о 
liberality provided by the minority, rather 
than the authority, subscale. Utilizing ае" 
modified sampling procedure, а suffice 
number of Ss scored in the consistent 80012 
attitude categories for statistical purposes 
This sample, derived from the consisten" 
quarters of the F scale and minority SU 
scale, will be discussed subsequently. 


The Authoritarian Sample 


‘The numbers and percentages of 58 
scoring in each quarter of the Е scale 18 
shown in Table 6. Both the low and high 
categories each comprise approximate? 
24% of the population, and the middle 992" 
ter, 26.5% (not shown in Table 6). Th 
authoritarian sample, therefore, numberé 
183, or three-quarters of the entire popu 


lation. 


TABLE 6 


NUMBERS AND PERCENTAGES OF SS IN THE F SCALE AND MINORITY SUBSCALES 
(Division of Ss by quartile) 


F Scale 
Minority 
Subscale Low Low-middle | High-middle High All Group? 
N| a {к | 9% |N| о | N| |м % 
High (rejection) в | 33 | 2 | 49 м | 69 |2 | 9.0 | so | 25 
High-middle 10 4.1 | 13 5.3 | 16 6.5 | 14 5.7 | 53 к 
Low-middle 12 4.9 | 27 | мо | 17 6.9 | 11 4.5 | 67 69 
Low (acceptance) | 28 | 11.4 | 13 5.3 | 12 4.9 | 13 5.3 | 6 0 
All Groups 58 | 23.7 | 65 | 26.5 | 62 25.3 60 | 24.5 | 245 100, 


SIMILARITIES IN THE STRUCTURE 


The Social Distance Sample 


The numbers and percentages of Ss scor- 
ing in each quarter of the minority subscale 
is also shown in Table 6. There were ap- 
proximately 27% in the low category, 24% 
in the high category, and 26.1% in the mid- 
dle quarter (not shown in Table 6). The 
social distance sample numbered 189, or 
slightly more than three-quarters of the 


entire population. 


The Consistent Sample 

The consistent sample included 5$ scoring 
in the same quarters of the F scale and the 
minority subscale, shown in Table 6. There 
were 28 consistent low scorers (those scor- 
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ing in the low quarters on the two scales), 
or approximately 11% of the population. 
The 22 Ss in the high quarters, the con- 
sistent highs, comprise 9% of the popula- 
tion. The consistent middles (not shown in 
Table 6) numbered 19, or 7.8% of the popu- 
lation. There was a total of 69, or 27.396 
of the entire population, in the consistent 
sample. 

It was expected that there would be a 
larger percentage of Ss in the consistent 
high category than in the consistent low. 
This was not borne out, as Table 6 indi- 
cates. Actually, the reverse occurred al- 
though the difference between the percent- 
ages is not significant. 


TABLE 7 


Score LIMITS AND NUMBERS or Lows, MIDDLES, 


AND HiGHs IN THE SOCIAL ATTITUDE SAMPLES* 


Social Attitude Sample 
Social 
Attitude "T Consistent Sample 
Category Authoritarian Үт енш 
эше Sample E Minority 
Scale Subscale 
Low 
ET 22 3.00 2.2 
pue m 1.90 1.0 1.70 1.0 
Lower limit „71 =. м B 
N 58 22 
Middle 
Upper limit 3.93 HE e 3.8 
Lower limit 3.40 2. 1 2.8 
N 65 64 19 
High _ 
Upper limit 5.43 1-0 M 1.0 
Lower limit 4.23 4. : 4.4 
N 60 59 28 
All groups - " 
N 183 
pA 74.7 $1.41 27.3 


a Scores were interpreted as follows: 


. Strong opposition, disagreement 
Moderate opposition, disagreement 
Slight opposition, disagreement 
(Neither opposition nor een 
Jose kinship by marriage 
1 To ЖУ club as personal chums 


as neighbors "A 
есе i cupation in mv 


peus 


. To my 
To employment in my oct 


(Corresponding. qu: 
Computed on the basis of the entire population 


Pore 


of 253. 


country 
Consistent Sample uu 
arters of the F scale and minority subscale). 


Authoritarian Sample (F scale) 


5. Slight support, agreement 
6. Moderate support, agreement 
7. Strong support, agreement 


ial Distance Sample (Minority subscale) 


5. To citizenship in my country 
6. As visitors to my country © 
7. Would exclude from my country 
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Although the low, middle, and high social 
attitude categories in the authoritarian, 
social distance, and consistent samples are 
relative to the actual distribution of scores 
obtained, Table 7 shows a favorable corre- 
spondence between the ranges of scores for 
each category with the actual meanings of 
these scores. The categories of the con- 
sistent sample, derived from comparable 
quarters on the F scale and social distance 
scale, can be interpreted from Table 7 as 
follows: j 

The consistent low. Ss in this category exhibited 
moderate or slight opposition to authorization be- 
liefs on the F scale, and accepted ethnic minorities, 
on the minority subscale, to kinship by marriage 
and to club as personal chums. 

The consistent middle. Ss in this category ex- 


hibited slight opposition, or neither opposition nor 
support of authoritarian beliefs on the F scale, 
and accepted ethnic minorities, on the minority 
subscale, as neighbors and to employment in ones 
Occupation. н 

The consistent high. Ss т the consistent high 
category exhibited neither opposition nor support 
of authoritarian beliefs on the F scale, and their 
response to ethnic groups, on the minority sub- 
scale, ranged from acceptance in one’s occupation 
to exclusion from one’s country. 


PERCEPTION AND POPULATION SIMILARITIES 
IN EXTREME SOCIAL ATTITUDES 


Having thus established categories of 
social attitudes and a single perceptual clo- 
sure dimension, it was possible to test di. 
rectly the hypotheses of this study. Table 


summarizes the results for all three samples . 


TABLE 8 


DIFFERENCES IN MEAN PERCEPTUAL CLOSURE BETWEEN Lows, MIDDLES, 
AND HIGHS OF THE SOCIAL ATTITUDE SAMPLES 
(Standard opening: 26 mm.) 


Social Attitude Category Difference 
Sample ا ج‎ 
Low Middle High Low-High | Low-Middle High-Middl* 
na, MEE ae 
Authoritarian 
N 60 65 59 
M 21.77 22.36 20.62 
SD 3.20 3.35 5.40 
Error* —4.23 —3.64 —5.38 
% error 16.3 14.0 20.7 
Difference 1.15 .59 1.74 
t 1.400 .995 2.155* 
Social Distance 
N 64 59 56 
M 22.10 21.15 20.63 
SD 3.26 5.32 3.62 
Error“ —3.90 —4.85 —5.37 
% еггог 15.0 18.7 20.7 
Difference 1.47 .95 .52 
t 2.333% 1.196 ‚610 
Consistent 
N 26 18 20 
M 20.80 22.86 19.95 
SD 2.28 3.67 3.36 
Error? —5.20 —3.14 —6.05 
% error 20.0 12.1 23.3 
Difference .85 2.05 2 "TT 
i 1.000 2.232* 2. 
т 


* Significant at .05 level. 


а Differences between three place means and standard in all tables, 
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In the authoritarian sample, the perceptual 
closure means for lows, middles, and highs 
indicated a general tendency toward curvi- 
linearity. The highs exhibited 20.7% error 
in underestimation, th» lows 16.396, while 
the middles reduced the size of the openings 
by 14.0%. The highs, therefore, showed а 
greater closure trend than the lows, but the 
middles showed the least closure. It is im- 
portant to note that had the middles been 
excluded, the general appearance of the 
tendency would have fostered the more 
familiar interpretation of linearity. Only 
the difference between the means of the 
authoritarian middles and highs is signifi- 
cant. The obtained t of 2.155 is significant 
at the 596 level of confidence. 

The social distance sample revealed a 
linear relationship between the minority 
subscale and the closure scale as indicated 
in Table 8. The percentages of error in- 
crease with hostility toward the ethnic 
minorities. Only the difference in the per- 
ceptual closure means of the social distance 
lows and highs is significant (at the 5% 
level of confidence). If the study were 
limited to the minority subscale, it would 
appear necessary again to interpret these 
results in accordance with the general con- 
tention that lows are different from highs 
in perceptual closure reactions. Арт 

The consistent sample, however, as indi- 
cated in Table 8.showed an unmistakable 
curvilinear tendency. The percentages of 
error in underestimation for the consistent 
lows and highs were 20.0 and 23.3%, re- 
spectively, while for the consistent middles 
the reduction was only 12.1%. The extreme 
categories, therefore, exhibited a greater clo- 
sure tendency than the middle. 

Testing the difference between the means 
of the consistent lows and highs, a t of 1.000 
was obtained. Since the difference is not 
statistically significant, this confirms our 
t hypothesis which stated : 


o apparent difference in perceptual 
between individuals predisposed 
nservative social attitude and 
d extreme liberal social 


firs 

There will be n 
closure reaction 
toward extreme cO 
those predisposed отаг 


attitude. 
Testing the diff 
of the consistent 


erence between the means 
lows and middles, a t of 


2.232 was obtained. The difference is sta- 
tistically significant at the 5% level of con- 
fidence, thus confirming the second hypothe- 
sis which stated : 

Individuals predisposed toward extreme liberal 
social attitude will manifest greater perceptual 
closure reaction than those exhibiting moderate 
social attitude, 

In a similar manner, the difference be- 
tween the means of the consistent highs and 
middles was tested. The obtained # was 
2.476, also statistically significant at the 596 
level of confidence, therefore confirming the 
third hypothesis which stated : 

Individuals predisposed toward extreme conser- 
vative social attitude will manifest greater per- 
ceptual closure reaction than those exhibiting 
moderate social attitude. 

The closure tendencies indicated in Table 
8 are graphically presented in Figure 1, 
where a comparatively linear trend is ap- 
parent for the authoritarian and social dis- 
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Fic. 1. Perceptual closure and the social atti- 
tude samples. 


tance sample in contrast to the definite 
curvilinear relationship obtaining between 
perceptual closure for the consistent sample. 


‚ А comparison of the results of two pre- 
vious exploratory studies with those of the 
present investigation shows striking simi- 
larities, and adds additional support to our 
findings. In the fall of 1952, 89 New York 
City secondary school Ss were administered 
an F scale and perceptual closure scales 
similar in every important respect to the 
scales administered to the Houston sample. 
А social distance scale, however, was not 
employed. The only indication of social 
attitudes, therefore, was provided by the 
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TABLE 9 


CoMPARISON OF THE NEW Уовк AND Houston 
STUDIES WITH REGARD TO PERCEPTUAL 
CLOSURE AND AUTHORITARIANISM 


New York City Houston 
Study Study 
Item 
Per- Per- | 
ceptual | Authori- | ceptual | Authori- 
Closure? | tarianism | Closure? | tarianism 
N 86 89 242 253 
M 21.49 | 4.07 21.57 3.63 
SD 3.79 At 4.04 .82 
Error —4.51 —4.43 
% error 17.3 17.0 


a Standard opening: 26 mm. 


F scale. One of the hypotheses of this study 
was: "There will be a positive relationship 
between a Gestalt closure scale represent- 
ing perceptual closure and the California 
Fascism Scale representing cognitive clo- 
sure" (B. Taylor 1952, p. 4). Both scales 
were divided at the median, and the results 
indicated a significant tendency for the same 
individuals to be consistently high or low on 
the scales, thus confirming the hypothesis. 
The middles were excluded since the median 
was employed for establishing the high and 
low categories. 

Table 9 compares the F scale and the 
perceptual closure scale results of the New 
York City and the Houston studies. The 
number of 55 in the former was 89, three of 
whom did not complete the closure scales. 
There were 253 Ss in the Houston study 
with 242 completing this scale. Eliminees 
on the closure scales were 3 and 4%, respec- 
tively. Since the standard size of the con- 
figuration openings on the scales in both 
studies was the same (26 mm.), it can be 
seen that for both groups there was a highly 
significant tendency to underestimate the 
standard openings in the reproductions. The 
New York City group showed a 17.3% 
closure tendency, and the Houston group 
17.0%. Both groups, therefore, exhibited 
practically the same degree of closure. 


Comparison of the F scale means for the 
two groups in Table 9 indicates a difference 
which is significant at the 1% level of con- 
fidence (CR = 4.536). Whereas the mean 
of the New York City groups was арргох!- 
mately at the theoretical neutral point 
(neither agreement nor disagreement with 
the F scale items), the Houston group 
yielded a relatively low mean (disagree- 
ment with the items), which was probably 
due to the fact that college students tend to 
reject F scale items to a larger extent than 
do secondary school students. 


When perceptual closure means were 
computed for each quarter of the F scale 
for the New York City and Houston groups, 
as shown in Table 10, the two groups bear 
a strong resemblance with regard to closure 
tendencies. Closure decreases with rejec- 
tion of authoritarionism, although this te^ 
ency reverses itself for the low quarters. : 
is important to note that this reversal ° 


TABLE 10 


COMPARISON Or PERCEPTUAL CLOSURE FOR 55 w 

THE QUARTERS OF THE CALIFORNIA FASCISM SCi ae 

FOR THE New York City AND HOUSTON STUDI 
(Standard opening: 26 mm.) 


F Scale 
Study Quartile Division 
ў АП 
Low- | High- 5 
Low | middle | middle | High iac 
New York 
City 
N 22 22 21 |21 s 48 
M 22.08| 23.01| 20.68 20.07 3.79 
SD 4.31 3.94 3.25| 3.34 151 
Еггог — 3.92 | — 2.99 | — 5.32 — 5.93 773 
9 еггог | 15.1 11.5 20.5 | 22.8 мч 
Houston 
N 60 62 61 |59 тл 57 
M 21.77 | 22.12| 21.72 20.62 104 
5р 3.20 3.47 3.61 5.40 _443 
Еггог —4.23 | —3.88 | —4.28 —5.38 17.0 
% еггог | 16.3 14.9 16.5 | 20.7 
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lows occurs in both studies. This tendency 
is clearly shown in Figure 2. 


= -600| р 
t 4 
E 
g 
5 
3 500. 
2 
Е 
D 
= 
5 
g 
= 400] 
5 | 
E is Jl о—о——О HOUSTON GROUP 
> “Эм, / ө--ө--ө NEW YORK CITY 
a 
@ 200 se’ GROUP 
o dz TR Won 
000 тож MIDDLE 


МОЕ 
SOCIAL ATTITUDE CATEGORIES 


Fic. 2. Comparison of perceptual closure for 
the New York City and Houston groups by quar- 
ters on the California F Scale. 


ТЕ the division between low and high is 
established at the median for the two groups, 
rather than at quartile divisions, the curvi- 
linear trends are obscured, and the results 
support the linear interpretation as indicated 
in Table 11. The difference in the closure 


TABLE 11 


COMPARISON OF PERCEPTUAL CLOSURE FOR Ss 
ABOVE AND BELOW THE MEDIAN OF THE CALIFORNIA 
Fascism SCALE FOR THE NEW York City AND 
HOUSTON STUDIES 
(Standard opening: 26 mm.) 


F Scale 
Study Median Division 
س‎ 
Both 
Low High Groups 
New York City 
N 44 42 86 
M 22.55 | 20.38 | 21.49 
SD 4.16 3.23 3.79 
Error —3.46 | —5.63 | —4.51 
% error 13.3 21.7 17.3 
x 122 120 242 
M 21.95 21.18 21.57 
SD 3.35 4.61 4.04 
Error —4.05 | - 2 —4.43 
% error 15.6 18.5 17.0 
НЕ ЕЕ а 


means for the lows and highs in the New 
York City group is statistically significant 
at the 1% level of confidence (CR = 2.677). 
The difference in means for the highs and 
lows in the Houston group, however, is not 
statistically significant. Figure 3 shows these 
tendencies. 
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Fic. 3. Comparison of perceptual closure for 
the New York City and Houston groups above 
and below the median of the California F Scale. 


A second exploratory study was designed 
to test the similarity of extremes hypothesis 
in perceptual closure. In addition to a 12 
item F scale and a perceptual closure scale, 
21 secondary school Ss were administered a 
minority social distance scale, comparable to 
the one employed with the Houston group. 
When the consistent sample was extracted, 
and perceptual closure means computed for 
lows, middles, and highs, a curvilinear trend 
was obtained. Actually, the three Ss in the 
consistent low category were also the three 
manifesting the greatest degree of percep- 
tual closure. Since the consistent sample 
comprised only nine Ss with three in each 
category, statistical treatment was not feasi- 
ble. This exploratory study, however, sug- 
gested the desirability for more intensive 
investigation. 

The design scale was included to test the 
assumption that completion occurs, and is 
systematically related to social attitudes, 
only when the figure is perceived as a single 
configuration. The design scale was re- 
ported as having a zero correlation with the 
closure scales. It was also important to 
examine the relationship between the design 
scale and the social attitude scales. These 
results are presented in Table 12. The re- 
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TABLE 12 


DIFFERENCES IN DESIGN SCALE MEANS BETWEEN 
Lows, MIDDLES, AND HIGHS OF THE 
SOCIAL ATTITUDE SAMPLES 
(Standard opening: 26 mm.) 


Social Attitude Category 
Sample 
Low Middle High 

Authoritarian | 

N 61 67 61 

M 25.98 26.48 26.05 

SD 5.02 5:21 5422 

Error —.02 .48 .05 

% error .1 1.8 .2 
Social Distance 

N 66 63 59 

M 26.73 26.54 26.70 

SD 5.05 5.73 4.28 

Error NE .54 .70 

% error 2.8 2:1 2.7 
Consistent 

N 28 19 22 

M 26.56 | 25.27 25.74 

SD 4.67 4.19 5.43 

Error .56 —.73 —.26 

% error 2.2 2.8 1.0 


sults confirmed the prediction that there 
would be no consistent relationship between 
Social attitudes and the size of reproduced 
openings of the design scale. Examination 
of the computed perceptual closure means 
for the lows, middles, and highs for the 
three samples indicated that all the means 
did not deviate from the standard (26 mm.) 
in either direction by more than .73 mm. 
None of the differences between means 
reached anywhere near statistical signifi- 
cance. The perception of a single configura- 
tion, therefore, appears to be crucial with 
regard to the relationship between percep- 
tual closure and social attitudes. 

The results obtained regarding the rela- 
tionships between population characteristics 
and social attitudes also substantiate our 
findings thus far. Demographic characteris- 
tics were examined primarily for possible 
additional evidence of curvilinearity. The 


six population variables about which infor- 
mation was obtained were age, sex, religion, | 
parental income, education, and academic — 
major. 

Table 13 shows the mean age for lows, 
middles, and highs for the authoritarian, 
social distance, and consistent samples. For | 
all three samples, age is curvilinear, 1 - 
increases with intensity of attitude rather - 
than direction. In the authoritarian samp 6 | 
however, none of the differences are 5167 
nificant. In the social distance sample Шш | 
3.45 yrs. difference between highs and m i 
Чез is significant at the 5% level of con 
dence (CR = 2.189), and the difference d 
3.11 yrs. between lows and highs just o 
the 5% level of confidence (CR =. 1.90 15 
The difference between lows and middles ^ 
not significant. None of the differences 
tween age means for the consistent sam 
are significant. БЕ 

In comparing the distribution 0 
within the social attitude categories, 510“ 
in Table 14, a trend toward curvilinear 
is also apparent; in all three samples ther? 
were more women in the extreme social v | 
tude categories. In the authoritarian затр, E 
only the 23.3% difference between me? 


TABLE 13 | 


ND 
DIFFERENCES IN AGE FOR Lows, MIDDLES, А 


HIGHS IN THE SOCIAL ATTITUDE SAMPLES 
Social Attitude Categor? 
Sample | 
Low | Middle | Hig" 
SS 
Authoritarian 62 
N 60 67 7.31 
M 28.80 26.10 s 
SD 12.07 7.38 * 
Social Distance 5 
М 66 62 9.66 
M 26.55 26.21 20:04 
5р 9.09 8.11 á 
Consistent de 
N 28 19 31.14 
M 28.14 27.37 9,03 
SD 10.00 7.78 б 


| 
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COMPARISON OF NUMBERS AND PERCENTAGES OF 
MEN AND WOMEN IN THE Lows, MIDDLES, AND 
HIGHS IN THE SOCIAL ATTITUDE SAMPLES 


Men and 
Men Women | Women 
Sample 
N| % |N| % |N| % 
Authoritarian 
High 36| 59.0 | 25 | 41.0 | 61 100.0 
Middle 48 | 71.6 | 19 | 28.4 | 67 | 100.0 
Low 29 | 48.3 | 31 | 51.7 | 60 | 100.0 
Social Distance 
High 29 | 50.0 | 29 | 50.0 | 58 | 100.0 
Middle 42 | 66.7 | 21 | 33.3 | 63 | 100.0 
Low 41| 62.1 | 25 | 37.9 | 66 | 100.0 
Consistent 
High 12| 57.1| 9 | 42.9 | 21 | 100.0 
Middle 16| 34.2 | 3 | 15.8 | 19 | 100.0 
Low 15 | 53.6 | 13 | 46.4 | 28 100.0 


the lows and middles is significant, at the 
1% level (# = 2.728). In the social distance 
sample, only the 16.7% difference between 
men in the highs and middles is significant, 
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at the 5% level (? = 2.212). None of the 
differences between percentages for the соп- 
sistent sample are significant, although the 
30.6% difference for men in the low and 
middle groups just misses the 5% level of 
confidence (¢ = 1.823). Differences in the 
percentages for women, as in the case of 
men, are significant between authoritarian 
lows and middles (1% level, ? = 2.485) and 
between social distance highs and middles, 
(59% level, t = 2.212). 

Table 15 shows the distribution of Protes- 
tants, Catholics, and Jews among the cate- 
gories of the samples. There is no discerni- 
ble trend and none of the differences in any 
of the samples are significant. 

Table 16 compares the mean estimated 
yearly income of the parents of lows, mid- 
dles, and highs in the three samples. There 
is a noticeable curvilinear trend for all three 
samples; in each instance, a lower income 
mean is reported by extremes. In the au- 
thoritarian sample, however, none of the 
differences are significant. The trend for 
the social distance sample is clearly curvi- 
linear. Lows and highs are not significantly 
different, but the difference between lows 
and middles is significant at the 59 level of 
confidence (СВ. = 2.037), and the differ- 


TABLE 15 


COMPARISON OF NUMBERS AND PERC! 
IN THE Lows, MIDDLES, AND 


ENTAGES OF PROTESTANTS, CATHOLICS, AND Jews 
HIGHS оғ THE SOCIAL ATTITUDE SAMPLES 


Protestant Catholic Jewish All Groups 
Sample 
N % N % N % N % 
Authoritarian 
High 44 73.3 14 23.3 2 3.3 60 100.0 
Middle 56 84.8 10 15.2 — = 66 100.0 
Low 48 82.8 8 13.8 2 3.4 58 100.0 
e un 48 84.2 8 14.0 1 18 | 57 100.0 
Middle 51 81.0 12 19.0 = = 63 100.0 
er 46 73.0 15 23.8 2 5.2 63 100.0 
Consistent 17 80.9 3 14.3 1 4.8 2 100.0 
High 16 84.2 3 is | = E 19 100.0 
moti 21 30.8 4 15.4 1 3.8 26 100.0 
OW 
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TABLE 16 


YEARLY INCOME OF PARENTS FOR Lows, MIDDLES, 
AND HIGHS IN THE SOCIAL ATTITUDE SAMPLES 


Social Attitude Category 
Sample 
Low Middle High 
Authoritarian 
N 60 65 59 
M $5660 $5720 $5200 
SD 1920 1980 1860 
Social Distance 
N 66 62 52 
M $5440 $6300 $5340 
SD 2000 1780 1820 
Consistent 
N 28 19 19 
M $5860 $7080 $4240 
SD 2120 1620 1580 


ence between highs and middles is signifi- 
cant at the same level of confidence (CR = 
2.198). In the consistent sample, although 
the trend is curvilinear, the difference be- 
tween the lows and highs is significant at 
the 5% level (f = 2.209). Also, the differ- 
ence between the highs and middles is sig- 
nificant at the 1% level of confidence (¢ = 
4.075). 

The mean education level for the cate- 
gories on the social attitude scales for the 
three samples is presented in Table 17. The 
trend in both the authoritarian and social 
distance sample is linear, although the direc- 
tion of linearity is reversed. In the authori- 
tarian sample, the difference in mean educa- 
tion between lows and highs is significant 
at better than the 1% level (CR = 2.890) 
The difference between education means for 
the lows and middles is also significant, at 
the 5% level of confidence (CR = 2.002) 
None of the differences in either the ‘social 
distance or the consistent sample are sig- 
nificant. 

The final population characteristic was 
academic major. This was divided into two 
broad categories by five staff members of 
the Psychology Department at the Univer- 
sity of Houston for purposes of statistical 


treatment. Majors included in the first cate- 
gory were those described as essentially 
involving tangibles and were called “con- 
crete.” These included the natural sciences, 
biological sciences, and technologies. Majors 
included in the second category were those 
described as primarily concerned with in- 
tangibles, and were called “abstract.” These 
included the social sciences, liberal arts, and 
education. The relationship between aca- 
demic major judged “concrete” or “ab- 
stract" to the categories of the social atti- 
tude samples is shown in Table 18. There 
is а discernible tendency in all samples for 
the number of Ss with "concrete" majors 
to increase with conservative social attitudes 
and, conversely, for the number of Ss with 
"abstract" majors to increase with liberal 
social attitudes, although this tendency, us- 
ing the chi square test, 1s not significant. 
Thus far, the analysis has been confined 
to the authoritarian, social distance, and 
consistent samples with regard to the per- 
ception and population characteristics. Ex- 
amination of the "inconsistent" sample also 
proved fruitful. This sample was defined 
as the inversely related quarters of the Е 


TABLE 17 


EDUCATION LEVELS FOR Lows, MIDDLES, AND 
HIGHS IN THE SOCIAL ATTITUDE SAMPLES? 


Social Attitude Category 
Sample 
Low Middle High 
Authoritarian 
N 59 67 59 
M 3/18. 3.017 3.00 
5р 1.20 1.36 1.50 
Social Distance 
N 65 63 58 
M 3.15 3.27 3.47 
SD 1.47 1.30 1.41 
Consistent 
N 28 19 21 
M 3.46 3.68 3.62 
SD 1.24 1.03 1.45 


* Mean education levels were interpreted as follows: Fresh, 
man—1.00; Sophomore—2.00; Junior—3.00; Senior—40"* 
Graduate—5.00. 
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TABLE 18 


RELATIONSHIP BETWEEN 


THE NUMBER OF “CONCRETE” AND “ABSTRACT” 


ACADEMIC MAJORS TO THE SOCIAL ATTITUDE SCALES 
(Distribution of Ss: quartile division) 


F Scale Minority Subscale Consistent Quarter 
“Con- “Ab- “Con- “Ab- “Con- “Ab- 
crete” | stract" | Both crete” | stract" | Both | crete” | stract” | Both 
High 28 27 55 27 30 57 11 10 21 
High-middle 28 30 58 25 25 50 6 8 14 
Low-middle 29 34 63 29 31 60 10 16 26 
Low 20 39 59 20 42 62 7 21 28 
All categories 105 130 235 101 128 229 34 55 39 


scale and minority subscale. Table 6 indi- 
cates that there were eight inconsistent lows, 
ie, Ss falling in the low quarter of the F 
scale and the high quarter of the minority 
subscale, 13 inconsistent low-middles, 17 in- 
consistent high-middles, and 13 inconsistent 
highs, ie. Ss in both the high quarter on 
the F scale and the low quarter in the 
minority subscale. The inconsistent sample 
comprised a total of 51 Ss, or approxi- 
mately 20% of the entire population. 

The problem of inconsistency Was inter- 
esting for two reasons. First, analysis of 
the inconsistent sample is essentially a de- 
viant case analysis, 1.6., cases which are fre- 
quently discarded as “exceptions” ог 
“chance” variants. Second, the problem of 
inconsistency becomes meaningful if one 
assumes that the two scales measure social 
attitudes at two levels, the social distance 
scale at a relatively peripheral, overt OT 
explicit level, and the F scale at a more 
central, covert or implicit level. It would 
follow then that the inconsistent low scorer 
is implicitly liberal and explicitly conserva- 
tive and the inconsistent high scorer, 1m- 
plicitly conservative and explicitly liberal. 
This interpretation ig weakened, however, 

. (ha fact that low scores on the F scale 
by the fact t : cede 

ғ a reliable indication of 
are probably not a rele | шоку 
liberal tendency, 35 discussed Prev”. У. 
i г { the following analysis is to 

he purpose 0 | high and the in- 
Compare the inconsistent "IB 


consistent low with regard to the major 
variables of this study. 

The mean perceptual closure, age, paren- 
tal income, and education for the four in- 
consistent categories are presented in Table 
19. Although there is a tendency for incon- 
sistent extremes to manifest less closure 
tendencies than the middles, none of the dif- 
ferences are statistically significant. Table 
19 also shows that the mean age for the 
inconsistent highs is 23 yrs., which increases 
through the middle categories and reaches 
33 for the inconsistent lows. The mean 
span of 10 yrs. between the two extremes 
is significant at the 5% level of confidence 
(t = 2.584). In the case of parental income 
а curvilinear trend is apparent, although 
none of the differences are significant. Edu- 
cation level shows a linear trend similar to 
age, and, as in the case of the latter, the 
difference between the extremes is signifi- 
cant, at well beyond the 196 level of con- 
fidence (t = 4.527). That education and 
age varied similarly is probably due to the 
correlation between the two. 

Table 20 presents the numbers and per- 
centages of S's in the inconsistent categories 
in terms of sex and religion. There was a 
noticeable trend for the relative percentage 
of men to increase with inconsistent high 
social attitudes and for the relative per- 
centage of women to increase with incon- 
sistent low social attitudes. The small num- 
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moreover, were less reliable, as indicated by 
the wide variations in the reproductions of 
several openings in single geometric con- 
figurations. Relevant is the investigation by 
Goldstein and Scheerer (1941). Certain Ss 
described as concrete in behavior were un- 
able to reproduce meaningful figures from 
memory without changing them into more 
concrete ones. A line, circle, and a square, 
for example, were later reproduced by one 
S as windows ОЁ a church. It is possible 
that the presentation of relatively abstract 
figures to Ss tending toward concreteness 
of thought produces a disrupting effect upon 
their reproductions, thus rendering scores 
unreliable. This would reduce or destroy 
any relationship that may have been ob- 
tained between the reproductions and other 
measures, and may possibly explain why 
reproductions of opened, abstract figures, 
while exhibiting completion tendencies, did 
not systematically relate to social attitudes. 

The important finding of this study is 
that extremes manifested similar perceptual 
closure tendencies. It is necessary to empha- 
size, however, that this alone is not sufficient 
for generalizing. The basis for generaliza- 
tion lies mostly in the discernibly similar 
pattern of responses inherent in the be- 
havior of extremes, In one sense, therefore, 
this study may be considered only as a 
clear indication of this similarity. 

The interpretation of similarity of ех- 
tremes is strengthened by the additional 
curvilinear findings. Age increased with the 
intensity rather than the direction of an 
attitude. Women tended to be more intense, 
in either direction, than men in their social 
attitudes. Parental income also showed a 
curvilinear tendency, with extremes report- 
ing lower incomes than middles. Only some 
of these tendencies were found significant 
but, taken together, they support the con- 
tention of similarities. 

If it is contended that extremes are simi- 
lar, the next question arises as to why they 
are similar in the manner described. Why 
for example, did not the extremes tend to 
open, rather than close, the openings of the 
configuration? If one of the related char- 
acteristics of intense social attitudes is 
mental rigidity, then it would seem that Ss 
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scoring in the extreme quarters should 
carefully follow instructions in reproducing 
figures, thus increasing rather than decreas- 
ing their accuracy. It is important to realize, 
however, that mental rigidity involves en- 
during habits instilled prior to the experi- 
mental situation. If prior life-sets involve 
convictions that the outline of a boy ога 
house ought to be complete, then this set 
precedes and supersedes the instructions tO 
reproduce the figure accurately. Ss, on the 
other hand, that can be characterized as less 
intense can respond more accurately to the 
having less interferences from 
That extremes tend to cling to 
with the inevitable 
is known to be 
1 ex^ 


instructions, 
prior sets. 
what is most familiar 
consequence of distortion 
characteristic of Ss predisposed towarc 
treme social attitudes. » 

Another consideration involves clarifying 
the meaning of similarities and differences 
The attempt in the present research has 
been to explore similarities at the personality 
level as distinct from overt social behavior 
The demonstration of similarities at the 
personality level has not been interpreted 25 
nullifying the social differences in the be 
havior of extremes. Rather, an understand" 
ing of both the similarities and the differ" 
ences was considered more comprehensive 
than the prevalent tendency to emphasiz€ 
only differences. The problem is not whether 
extreme liberals and extreme conservatives 
are similar or different, but, in what way 
are they similar, and in what way different! 
The present research has attempted to an 
swer this question by suggesting that e 
tremes tend to be genotypically similar an 
phenotypically different. 


| 
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А final consideration with regard to sim 


larities is concerned with the middle group 
То the extent that the extremes can be сой 
bined at a personality level, the rel 
obscured middle group also becomes 
for rescarch in social attitudes. as 
One of the unanticipated findings © 
the positive correlation between the § in 
scales of the social distance scale, whic yh 
dicated that Ss tended either to accept “ect 
minority and authority groups or to re pat 
both. This was surprising, considering ^ 4s 
included in the former were such grouP 
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Negroes and Mexicans, and, in the latter, 
Ku Klux Klan and Nazis. This finding, if 
substantiated by further research, indicates 
the importance of what may be described as 
personal distance, a general tendency to re- 
Spond to all groups in a relatively similar 
manner, rather than the more familiar con- 
ception of social distance, implying a tend- 
ency to respond in widely different manners 
to specific groups. This tentative interpreta- 
tion is supported by the finding of Hartley 
(1946), who, in discussing ^... the gen- 
eralized nature of tolerance" states, “Ар- 
parently there is evidence to suggest that 
degree of tolerance expressed by individuals 
is a generalized function of the individual 
and is not completely determined by the 
specific group toward which the attitude is 
directed” (p. 25). 

A similar finding was also reported by 
Murphy and Likert (1938) in a study in 
which a high correlation was obtained be- 
tween two randomly chosen groups for 
whom Ss had previously expressed prefer- 
ences. In the present study, the results sup- 
port the interpretation of personal distance 
to an even greater extent, since the groups 
correlated were not random, but represented 
clearly opposed groups. 

There is an important similarity between 
perceptual closure and personal distance 
which supports one of the major assump- 
tions underlying this research. Extreme 
liberalism and extreme conservatism can be 
described as manifestations of the same 
underlying intensity which was character- 
ized by perceptual closure. Authority and 
minority social distance, similarly, can be 
described as manifestations of the same 
underlying degree of acceptance which was 
characterized by personal distance. Both 
perceptual closure and personal distance, 
therefore, can be interpreted as genotypic 
constructs with opposed phenotypes. 

The analysis of deviant cases—Ss in the 
inversely related quarters of the F scale and 
minority subscale—revealed а coherent pat- 
tern of differences between the “inconsist- 
ent” low who can be described as rejecting 
both authoritarian beliefs and bec гы 
norities and the “inconsistent high who 


itari i a thnic 
accepted authoritarian beliefs and e 
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minorities. The “inconsistent” lows were 
older, at a higher level of education, and 
predominately women. The “inconsistent” 
highs, conversely, were younger, at a lower 
level of education, and predominately men. 
With the interpretation that the F scale 
measures the implicit level of an attitude, 
and that the minority subscale measures the 
explicit level, there no longer appears to be 
a true inconsistency, but rather differences 
in the organization of an attitude. This 
tentative formulation suggests a dynamic 
typology involving consistent implicit and 
explicit and “inconsistent” implicit and ex- 
plicit social attitudes. This classification 
would appear to be more realistic than the 
familiar linear ones, since it takes into con- 
sideration the important observation that 
individuals can exhibit various degrees of 
social attitudes ranging from extreme liber- 
ality to extreme conservatism depending 
upon, among other things, the level of or- 
ganization of the attitude involved. 


SUMMARY AND CONCLUSIONS 


It was the purpose of this research to 
compare perceptual closure tendencies of 
extreme liberals, extreme conservatives, and 
intermediate groups. Extreme liberals were 
defined as those expressing relatively strong 
disagreement with the authoritarian beliefs 
of the F scale and accepting ethnic minori- 
ties. Extreme conservatives were defined as 
those expressing relatively strong agreement 
with the items of the F scale and rejecting 
ethnic minorities. It was hypothesized that 
Ss exhibiting extreme social attitudes would 
demonstrate a similar tendency to prema- 


turely close openings in reproducing con- 
Тє was further 


figurations containing gaps. 

hypothesized that either extreme would ex- 
hibit a greater degree of closure than the 
middle group. These hypotheses were based 
on observations of basic similarities dis- 
cernible in the behavior patterns of extremes 
in social situations. 

A review of the litera 
most studies have tended to emphasize ba: 
differences between extremes. Examination 
of the California scales upon which many of 
these results were largely based reveals that 


ture indicated that 
basic 


se . 


low scores do not necessarily reflect liber- 
ality since the scale items are expressed in 
authoritarian terms. Many of the differ- 
ences reported in the literature, therefore, 
may be due to this inherent weakness in the 
California scales. 

The materials used in the present study 
included the F scale, a social distance scale, 
containing a minority and an authority sub- 
scale, two perceptual closure scales, and a 
design scale. Three social attitude samples 
were derived from these scales. The au- 
thority sample consisted of the low, middle, 
and high scorers of the F scale. The social 
distance sample consisted of the low, mid- 
dle, and high scorers of the minority sub- 
scale. The consistent sample, the most im- 
portant in this research, was comprised of 
those Ss in the related quarters of the F 
scale and the minority subscale. This latter 
sample, although small in number, was con- 
sidered to provide a more reliable estimate 
of social attitudes. The two closure scales 
correlated .71 and were combined into a 
single scale. The design scale, which con- 
tained gaps between several configurations, 
demonstrated that completion tendencies 
occur only when the configuration is per- 
ceived as a whole. The population consisted 
of 253 students attending courses at the 
University of Houston. 

The correlation between the F scale and 
the minority subscale was .32. An unex- 
pected negative relationship of —.33 be- 
tween the F scale and the authority subscale 
indicated that Ss accepting authoritarian 
beliefs rejected authoritarian groups, and 
Ss Tejecting authoritarian beliefs accepted 
authoritarian groups. An unexpected cor- 
relation of .37 between the subscales of the 
social distance scale indicated that Ss tended 
either to accept both authority and minority 
groups or to reject both. 

Comparison of perceptual closure means 
for the categories of the social attitude 
samples revealed a significant curvilinear 
relationship for the consistent sample only, 
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thus confirming the hypotheses of the study. 
This result was supported by the findings of 
previous exploratory investigations which 
revealed tendencies toward curvilinearity- 
It was also supported by the additional find- 
ings that curvilinearity was apparent for 
age, which increased with attitude intensity ; 
for sex, where a preponderance of women 
exhibited intense attitudes, while most men 
were intermediate; and for parental in- 
come, which decreased with attitude inten- 
sity. Responses on the design scale were 
not systematically related to social attitudes. 

An analysis of the deviant cases, de- 
scribed as the "inconsistent" sample, in- 
cluded those who scored lowest on the 
scale and highest on the minority subscale, 
and those scoring in the reverse order. The 
"inconsistent" lows were distinguishable 
from the “inconsistent” highs in age, educa" 
tion, and, to a certain extent, sex. 

The results obtained in the present 16 
search suggested four major conclusions. 
First, there was the finding that extreme? 
were similar with regard to perceptual clo 
sure. This was interpreted as supportin 
the contention that the structure of socia 
attitudes of extreme liberals and extreme 
conservatives has basic similarities. Second, 
there was the finding that scores on the 
apparently opposed subscales of the soci4 
distance scale were positively related. Th! 
was also interpreted as indicating a bas!© 
structural similarity on the personality leve 
Both perceptual closure and personal dis- 
tance were interpreted as genotypes whic 
reveal themselves in apparently opposing 
phenotypes. Third, the finding of a cohere” 
pattern of “inconsistent” cases with regar 
to the relationship between the F scale ME 
minority subscale indicated the important 
of viewing attitudes as levels of implicit a? 
explicit organizations. Finally, the gene™ 
results indicated the feasibility of utilizing 
the consistent sampling technique for e 
ther investigating the structure of soc! 
attitudes. 
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APPENDIX A 


THE SOCIAL ATTITUDE SCALES 
THE CALIFORNIA FASCISM SCALE 


following statements refer to opinions regarding a number of social issues, about which some 


people agree and others disagree. Please mark each statement in the left-hand margin according to your 
agreement or disagreement, as follows: 


+1 slight support, agreement —1 slight opposition, disagreement 
+2 moderate support, agreement —2 moderate opposition, disagreement 
+3 strong support, agreement —3 strong opposition, disagreement 


—ÀáÁ 
НЕЕ. 


Пе 


— 11. 


. Nowadays w 


. Young peopl 


- What this country 


. Sex crimes, such 


Obedience and respect for authority are the most important virtues children should learn. 
No weakness or difficulty can hold us back i£ we have enough will power. 


Science has its place, but there are many important things that can never possibly be under- 
stood by the human mind. 


4. Human nature being what it is, there will always be war and conflict. 
. Every person should have complete faith in some supernatural power whose decisions he 


obeys without question. 


. When a person has a problem or worry, it is best for him not to think about it, but to keep 


busy with more cheerful things. 


. A person who has bad manners, habits, and breeding can hardly expect to get along with 


decent people. 


. What the youth needs most is strict discipline, rugged determination, and the will to work 


and fight for family and country. 


. Some people are born with an urge to jump from high places. 


hen so many different kinds of people move around and mix together so much, 


a person has to protect himself especially carefully against catching an infection or disease 


from them. 
An insult to our honor should always be punished. 
e sometimes get rebellious ideas, but as they grow up they ought to get over 


them and settle down. 


. It is best to use some pre-war authorities in Germany to keep order and prevent chaos. 


needs most, more than laws and political programs, is a few courageous, 
tireless, devoted leaders, in whom the people can put their faith. 

as rape and attacks on children, deserve more than mere imprisonment; 
such criminals ought to be publicly whipped, or worse. 


- People can be divided into two distinct classes : the weak and the strong. 
. There is hardly anything lower than a person who does not feel a great love, gratitude, and 


respect for his parents. 
Some day it will probably be shown that astrology can explain a lot of things. 


Nowadays more and more people are prying into matters that should remain personal and 
private. 


‚ Wars and social troubles may someday be ended by an earthquake or flood that will destroy 


the whole world. 


Most of our social problems would be solved if we could somehow get rid of the immoral, 


crooked, and feebleminded people. 
The wild sex life of the old Greeks and Romans was tame compared to some of the goings-on 


` dn this country, even in places where people might least expect it. 


Tf people would talk less and work more, everybody would be better off. 
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24. Most people don't realize how much our lives are controlled by plots hatched in secret places. 


—5. Homosexuals are hardly better than criminals and ought to be severely punished. 


— 26. The business man and the manufacturer are much more important to Society than the artist 
and the professor. 


27. No sane, normal, decent person could ever think of hurting a close friend or relative, 
— ——28. Familiarity breeds contempt. 
29. Nobody ever learned anything really important except through suffering. 


—— 30. America is getting so far from the true American way of life that force may be necessary 
to restore it. 


THE SOCIAL DISTANCE SCALE 


On this blank are listed the names of a number of different groups and descriptions of six ai. 
ent social relationships. For each group please check al! the social relationships to which you Mess 
willing to admit members of that group. If you would prefer not to admit members of а par En 
group to any of these six relationships, put a check mark opposite the name of that group in Colt 
(Would exclude from my country). 


| 
| 
| 
| 


1 2 3 4 5 6 7 
To close To my To my ast B do To citi- „Аз beer 
kinship club as street EC re Puer Tom ше 
m. n pe nei рока country country country country 
Fascists Em = we emt —— — ae = 
Hindus We ME. -— ‚ы — — ټك‎ 
Puerto Ricans m DEIN oe — سڪ‎ a کے‎ 
Reactionaries YS = 
Mexicans у жайы UMEN — 
Nazis hos. Ч 
Turks =s KERA ت‎ 
Negroes e к 
`Ки Klux Klan cr NE Е —— == 
Totalitarians + 4 mu 
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APPENDIX B 


THE PERCEPTUAL CLOSURE SCALES 


(Mean openings of original: 26 mm.) 
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THE DESIGN SCALE 


(Mean openings between configurations : 26 mm.) 


COPY THE REST OF THE FIGURES BELOW 
JUST AS YOU SEE THEM ABOVE 
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PERCEPTUAL CONDITIONS OF ASSOCIATION”? 


SOLOMON Е. ASCH, JOHN CERASO, anp WALTER HEIMER 
Swarthmore College? 


Te is an investigation of the role of 
the perceptual characteristics of the 
stimulus in the formation of associations. 

It would be natural to suppose that the 
dependence of association upon the prop- 
erties of the stimulus constitutes a central 
problem, given that what is associated cor- 
responds, often quite faithfully, to what has 
been perceived. For reasons that reach far 
into the history of thinking in our field, this 
has not been the case; the nature of the 
stimulus has largely been taken for granted. 
Nowhere is this more evident than in the 
traditional theories and experimental studies 
of association. Investigation carefully con- 
trols such conditions as contiguity, fre- 
quency, temporal relations, or the circum- 
stances surrounding the appearance of the 
stimulus. But the properties of the stimulus 
itself are as a rule not systematically 
treated; indeed, often they are described 
merely by naming or pointing. We propose 
to show that when one takes the structure 
of the stimulus into account, new and ne- 
pecten questions arise concerning the rela- 
P idus речени to association and con- 

Sin id t is nature of association. . 
четци rely for evidence concerning the 
berger of associations on the data of re- 

‚ We shall begin with the latter. Recall 
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is the end product of a sequence of events. 
The distal stimulus, the proximal stimulus, 
the resulting perceptual process, the after- 
effects it produces, and their subsequent re- 
activation in the form of recall, are a series 
of consecutive events each of which depends 
directly on those preceding. Many of the 
formulations concerning associations are 
statements of relation between the data of 
recall and these earlier events. Ideally, each 
of the successive steps just mentioned 
should find a place in the study of associa- 
tions. Practice falls far short of this proce- 
dure. Before introducing the problem of 
this investigation, it will be helpful to see 
how we generally go about establishing a 
relation between recall and its conditions. 

For illustration we choose the kind of re- 
call termed associative. The basic observa- 
tion is that the experience of a is followed 
by the recall of b. We consider this an in- 
stance of recall because we relate it to an 
earlier occasion when the corresponding 
stimuli, A and B, were given and observed. 
The correspondence between what is re- 
called and what was originally experienced 
forces us to conclude that the latter pro- 
duced an aftereffect that persists and can be 
reactivated. It is on this ground that we 
distinguish between the formation and recall 
of associations. Further, we bring into rela- 
tion the data of recall to the particular con- 
ditions prevailing at the time of original 
A and B were spatially 
contiguous, We relate the obtained recall to 
the condition of contiguity ; similarly, if A 
and B appeared in a particular temporal 
succession, or with a given frequency. 

In the instance just cited, it is usual to 
refer interchangeably to the preceding dis- 
tal, proximal, and perceptual conditions. 
This ambiguity is harmless when, as is often 
the case, the correspondence between these 
respective events is 50 lawful as to pose no 


experience. Thus, if 
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problem of distinction. As a rule, when A 
and B are contiguous, the proximal stimuli 
are also contiguous, and so are the phe- 
nomenal a and b. But the structural descrip- 
tion of the stimuli is often equally informal. 
This practice slights the advantages inherent 
in an exact specification of the stimulus, 
advantages that are particularly important 
when the stimulus is perceptual. 


Tue PROBLEM 


When we speak of the formation of an 
association we refer, then, to a functional 
connection between two experienced con- 
tents, an a and a b, corresponding to the 
objectively given A and B. That an asso- 
ciation presupposes iwo terms is so obvious 
a requirement that it has not provoked 
further inquiry. We shall nevertheless see 
that this necessary but unexamined condi- 
tion conceals ambiguities and problems of 
consequence about the role of perception in 
association. 

An examination of the classical associa- 
tion experiment as a perceptual situation 
will serve to introduce the present problem. 
The standard procedure is to introduce the 
learner to a pair of stimuli, A and В, paired- 
ness being established by means of distinc- 
tive contiguity—spatial, temporal, or spatio- 
temporal. Further, the stimuli are as a rule 
discrete and heterogeneous. 

The latter property of the association ex- 
periment brings us to a surprising observa- 
tion. Investigation in this region has been 
exclusively concerned with a particular kind 
of relation, namely, with stimuli each of 
which is a distinct perceptual or cognitive 
unit. Indeed, this property has become the 
criterion for the identification of an associa- 
tion. Investigation has confined itself to as- 
sociations formed between one unit and 
another. The outstanding feature of this re- 
lation is that the units are (or approximate 
to) a mere assemblage. Each has a separate 
existence; take one away and little happens 
to the other. 

This practice singles out a stimulus rela- 
tion that is, to be sure, frequent in experi- 
ence and important for theory. But it is 
neither the sole nor perhaps the representa- 


tive case. There are other stimulus rela- 
tions that are equally relevant to the prob- 
lems in this region. Most challenging are 
those instances in which the respective terms | 
belong to the same well-structured unit. | 
Relations of the latter kind have not been 
investigated in connection with the problems 
of association, probably because the special 
character of the prevailing procedure was | 
not suspected. At the same time, it seems 
justified to say that the various associative 
interpretations were intended to apply to 
most other instances of functional connec- 
tion. 

The thought from which we start may be 
restated as follows. The objective relations 
between ап A and B may be of different 
kinds; the corresponding perceived re ations 
may be decisive for the functional intimacy 
between the terms in question (and there 
fore for retention and recall). At the one 


extreme, an a may be so completely | 
sorbed in a larger unit b that it loses PSY 
chological identity. This is the case that 
Gottschaldt (1926) has investigated in the | 
region of perception; one can then no onge" 
speak of association. At the other extreme 
a and b may be discrete units; this is the 
case that investigation has favored under 
the rubric of association. There are also 11” 
termediate cases, with some of which WÊ 
will be concerned, in which a and b belong 
to a single, tightly organized unit while re“ 
taining their respective identities. 

The last-mentioned relation raises the fol- 
lowing question. Most units consist of рап, 
more generally, they possess more than one 
property, and more than one aspect. 
recall of such a unit involves the recall к. 
its several parts or properties, as well 2 
their order. The question of the function? 


intimacy between the parts of a unit 
stint 


surely as pertinent as that between di 
units. There are, in short, operations nece, 
sary to the perception and recall of а a 
that are formally analogous to the реге 
tion and recall of heterogencous unit р? ne 
Tn the light of this distinction, we ask je 
following questions. Are there observ sr i 
differences between the association of 16 п 
ogeneous perceptual units and the r dy 
tional connections formed within a © ~ 
у 
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structured perceptual unit? There is also a 
more fundamental question : whether the re- 
spective functional connections are qualita- 
tively the same, or whether they involve a 
difference of process. 

In the investigation that follows we have 
studied the functional connections estab- 
lished between stimuli related in a number 
of different ways. Our particular aim has 
been to compare the functional intimacy be- 
tween heterogeneous units and between the 
interdependent parts of clearly structured 
units. The rationale of the experimental 
procedures follows directly from the prob- 
lems just stated. We have studied the simi- 
larities and differences between stimulus 
conditions exemplifying different relations, 
but as far as possible identical in content. 
We have attempted in each case to specify 
independently the objective and the phe- 
nomenal properties of the stimulus. Since 
there are different particular sources of 
unity and nonunity, a range of conditions 
was studied. 


Tur CONSTITUTIVE RELATION 


For the first step of this investigation we 
chose perceptual stimuli bound by the rela- 
tion we will call constitutive. For this pur- 
pose we turned to the following perceptual 
datum, A visual contour possesses at mini- 
mum two properties: it delineates a shape, 
but it also has a concrete material character ; 
the contour line may be thick or thin, con- 
tinuous or discontinuous, dashed or dotted. 
The same form, such as the circle in Fig- 
ure 1 below, may thus be represented in dif- 
ferent ways. Let us call the one property of 
a contour line its form (or Е), the other its 
mode (or M), 
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Fic. l. Form and mode. 


Concerning these properties of a visual 
contour we may say the following. First, 
m and mode are inseparable: a given 
or another mode. 
ably united : 


for 
form must appear in one ог an 
But second, they are not invari 


a given form may appear in a number of 
modes. Third, the mode of a contour also 
has form quality: it is a form in its own 
right. Fourth, the modal form in turn may 
possess a mode. Fifth, the final modal prop- 
erty of a visual stimulus has a mode which 
is continuous. The dot and the straight line 
are instances of ultimate modes. 

The relation between form and mode may 
be characterized as follows. 1. Each is a 
perceptual unit, the mode as much as the 
form, but they stand in a particular hier- 
archical relation, The spatial distribution of 
the modes generates a particular shape; 
thus, the shape is dependent on the modes 
as its constituents. The relation of depend- 
ence is such that neither term can retain its 
character without the other. 2. As a neces- 
sary corollary, form and modes are identical 
in locus; the form is where the modes are. 
3. The modes generate a shape by virtue of 
their position, not through their particular 
form character. Or, they delineate a shape 
in terms of their point quality alone. It fol- 
lows that the relation of the modal form to 
the encompassing form is not identical with 
the category of part—whole relation. 4. 
Finally, form and mode are clearly dis- 
criminable. 

Concerning stimuli of this structure one 
can ask: how strongly connected do the as- 
pects here singled out become under given 
conditions? When a subject (S) has in- 
spected one of the figures above and at- 
tempts later to recall it, we can establish 
whether he has recalled one, or both, or 
neither of the contour properties. In short, 
we can determine the coherence or jointness 
of the given properties in recall, or their 
functional intimacy.* If, now, we could 
separate these properties and constitute 
them into distinct units, we would have suc- 
ceeded in establishing a parallel condition, 
conforming to the traditional associative 
paradigm, to be compared with the preced- 
ing. This we propose to do in the following 
manner. 


*The stimuli in question possess, to be sure, 
other properties as well—such as size, orientation, 
position in space—but we do not consider them in 
this investigation, 
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Procedure 
The Stimulus Conditions 


The stimuli selected for the first experiments are 
reproduced in Figure 2. Those at the leít consist 
of different forms whose contours also differ in 
mode. At the right the same forms are all drawn 
continuously, but each is paired with a line con- 
taining the modal components of the corresponding 
figure at the left. 

The difference between the two sets of stimuli 
may be described in terms of the relations they 
exhibit. In the first set, the relation of mode and 
form is constitutive. The second set of stimuli con- 
sists of the same contents, but these are now 
spatially separated. Separation in this case destroys 
the constitutive relation and abolishes the struc- 
tural interdependence of the contents. These now 
form a pair of heterogeneous items, within which 
each has the same role or rank. It should be noted 
that form and mode were chosen to be hetero- 
geneous, and that they were equally heterogeneous 
in the two series. 

We can also characterize the two series in terms 
of their unity. By virtue of the constitutive rela- 
tion, form and mode of the first series belong to 
the same unit; the corresponding contents of the 
second series are dual. The basis for this distinc- 
tion is phenomenal; observers will agree that the 
two series differ in unity. Accordingly, we will 
call the first stimulus set unitary (or U) and the 
second set nonunitary (or non-U). This designa- 
tion is intended at this point to be only descriptive. 

It may be helpful to state in what respects these 
stimulus conditions do not differ. (a) Since form 
and mode are heterogeneous, and equally so in 
both conditions, the terms “unitary” and “поп- 
unitary” refer not to the number of stimulus prop- 
erties, but to the perceived relations between them. 
With respect to content, both sets of conditions 
were equally dual. (b) It follows that numerical 
singleness and duality were not the essential differ- 
ences. A nonunitary pair was such not because it 
consisted of separated units, but rather because the 
units were heterogeneous. Distinct units can well 
function as parts of an encompassing unit; again 
the relation is decisive. 

The U series consisted of 10 stimuli, the non-U 
series of 10 pairs of stimuli, Thus, form and mode 
correspond to the A and B terms of the traditional 
association experiment, 

The forms and modes were selected so as to be 
relatively simple and easily reproducible, We at- 
tempted to avoid the combination of forms and 
modes that might be similar or demand each other 
for figural reasons. (For a check on this require- 
ment see pp. 10-11.) 


5 Although the distinction between form and 
mode applies only to the unitary stimuli, we will, 
for the sake of convenience, designate similarly the 
corresponding parts of the nonunitary stimuli, 


The stimuli were drawn in black pencil on a 
white card (4 in. X 6 in.), The figures occupied, 
in both conditions, the center of the left-hand side 
of the card. The line portions of the non-U stimuli 
were centered on the right-hand side of the card, 
and were 13 in. long. The distance between figure 
and line of the latter stimuli varied from 13 to 2 
in., depending on the shape of the former. 

The contour line of a given U stimulus consisted 
of a sequence of identical modal forms. Six stimuli 
had discontinuous contours, four were drawn in 
continuous contour. The orientation of the modes 
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was kept constant, or independent of the changing 
orientation of the contour line, in all but two cases 
(stimuli 6 and 9). 

The lines of the non-U stimuli were all drawn 
horizontally. This involved a change of orientation 
of the modes in the transition from the U to non-U 
series in three cases (stimuli 6, 8, and 9). 


Conditions of Learning 


Experimentation was done individually. The 
cards on which the stimuli appeared were placed in 
iront of S against a slightly inclined wooden sup- 
port stand. Each stimulus was shown singly for a 
period of 4 sec.; timing was done with a metro- 
nome. Each card was placed face down at the end 
of 4 sec., immediately exposing the next card. The 
stimuli were shown in the order in which they are 
numbered in Figure 2. 

The conditions of learning were identical in each 
of the following variations. Ss were instructed to 
inspect the figures with care so that they could 
later reproduce them exactly. All Ss were thus 
directed to anticipate a free recall test. Particular 
care was taken to instruct Ss in the non-U condi- 
tion that they were to reproduce the pairs of 
stimuli on each card. It was also made clear to all 
Ss that it was not necessary to recall Ше order in 
which the stimuli appeared. 

"The instructions were as follows : 

Learning instructions: Unitary . 

This is a simple learning experiment. I am 
going to show you in succession а number of 
figures on the following cards. . 

After I have shown you the cards once, I will 
üsk you to remember and draw the figures that 
you have seen. Look at each card carefully, so 
that you may be able to recall as many of the 
figures as possible. You will try to make your 
drawings look as much like the original as 
Possible, 

Each card will be shown once. Do not try to 
memorize the sequence in which the cards are 
Shown. Any questions? 


Hearing instructions: Nonunitary 

aos 15 а simple learning experiment. I am 
E © Show you in succession a number о 
: Pr card y two drawings on it—at the left 
15 а Пбиге, and at the right is a line drawn in a 
particular way. 

After I have shown you the cards once, I will 
ask you to recall and draw the figure and the 
line that appeared on each card. Look at the 
drawings carefully so that you may be able to 
recall as many of the pairs as possible, and to 
them exactly as they appeared in the 
original. Please note that the task is to remem- 
1 я d к pairs of drawings on each card. You will 
va op rings look as much like the 
trv to make your drawings 
original as possible. 


draw 


Each card will be shown once. Do not iry to 
memorize the sequence in which the cards are 
shown. Any questions? 


Conditions of Recall Б 


Recall was tested 3 min. following the exposure 
of the last stimulus. The first 2 min. of this inter- 
val were occupied with a side task, consisting of 
the solution of anagrams. The Ss were then inter- 
rupted and given the recall instructions. 

There were three conditions of recall (with com- 
parable groups), both in the unitary and nonunitary 
conditions. 

Experiment 1: Free recall. Ss were provided 
with blank booklets on which they reproduced all 
the stimuli they could recall. They were instructed 
to draw the contents of one stimulus card to a 
page, then to turn the page back, and were warned 
not to return to it subsequently. No strict time 
limit was imposed; the experiment came to an end 
when it became evident that S could make no fur- 
ther progress. The U Ss were urged to reproduce 
either the form or mode alone when they could not 
recall a stimulus completely, and to guess when 
they were not certain. In the non-U condition they 
were asked to reproduce one of a pair when they 
could not recall its partner. 

We reproduce below the recall instructions: 

Free recall instructions: Unitary 

Will you now draw, in this booklet, the fig- 
ures exactly as they appeared earlier. Draw one 
figure to a page and then fold it under, You are 
{тее to recall the figures in any order, 

You will have noticed that each figure was 
drawn with a different contour. If you remem- 
ber only the figure, but not the form of the con- 
tour, just draw the figure in continuous outline. 
Similarly, if you remember only the form of the 
contour, but not the figure with which it went, 
draw a few samples of the contour. If you are 
not sure, try to guess. 

Free recall instructions: Nonunitary 

Will you now draw, in this booklet, the pairs 
of figures and lines exactly as they appeared 
earlier. Draw one pair to the page, and then fold 
it under. You are free to recall the pairs in any 
order. 

If you remember one of the pairs only, draw 
it; if you are not sure of the other member of 
the pair, try to guess, 

Experiment 2: Recall of mode. The booklets 
carried exact reproductions of the forms, all drawn 
continuously, one to a page. In the U condition the 
task was to draw the same figure with its original 
mode; in the non-U condition the task was to 
reproduce the previously paired line. Except for 


9 Ss were allowed to draw only a few samples 
of the mode when they showed impatience about 
going ahead. 
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the necessary changes the recall instructions were 
patterned after those reproduced above. 

The order of the forms in the recall test was dii- 
ferent from the presentation order, but constant for 
the entire group. Again, no exact time limit was 
imposed; as a rule Ss did not stay unduly long 
with any stimulus. Once they passed it, they were 
not allowed to return or make corrections. We 
shall refer to this as the form-to-mode (ЕМ) 
test. 

Experiment 3: Recall о} form. The booklets 
carried (on successive pages) each of the modal 
characters drawn as a line. U Ss were instructed 
to recall and draw the figure that originally ap- 
peared in that mode; non-U Ss were to draw the 
figure that had been previously paired with the 
given modal line. The instructions paralleled those 
of Experiment 2. The order of the modes was 
identical with that o£ the forms in the recall test of 
Experiment 2. We will call this the mode-to-form 
(M>F) test. 

Except where otherwise stated, the Ss were 
Swarthmore College students. There was an equal 
number of men and women in each variation. 


Scoring 


We will first describe the procedure of scoring 
the free recall reproductions. Each reproduction 
was first scored independently for form (Е) and 
mode (M). The method of scoring was liberal; a 
form or mode was scored correct if it was recog- 
nizable as one belonging to the series. Changes m 
orientation, size, or proportion of the forms, and 
similar changes of mode, were not taken into ac- 
count. The reproductions were scored by two 
judges; in case of disagreement they reached a 
joint decision. The task of scoring was relatively 
simple; disagreements occurred inírequently. We 
may note that certain highly stable and reproducible 
results, to be described subsequently, demonstrate 
that the procedure of scoring, despite an irreducible 
subjectivity, was quite adequate to the present 
purpose. 

In computing an S's total recall score we pro- 
ceeded as follows. Each correct reproduction of a 
form or of a mode received a score of 1. (Thus, 
the complete reproduction of a U stimulus received 
a score of 2.) If a form or mode was reproduced 
several times, it was scored as correct only once. 
If one of the reproductions was a correct pairing, 
it was recorded as correct at that point; if it was 
not correctly paired, it was scored as correct upon 
first recall. 

We also scored independently for accuracy of 
joint recall of form and mode, a measure that 
provides an index of the intimacy of connection 
between them. Here, we call attention to a termi- 
nological point. The joint recall of а and b in the 
nonunitary condition has always been called asso- 
ciative. Since we must leave open whether joint 
recall in the unitary condition involves the same 
process, We will employ a more neutral designation 
and will speak of correct or incorrect pairing. This 


more general term is not intended to describe the 
process in question; it is rather descriptive of the 
procedure for scoring jointness of recall. It cam 
of course, also be applied without prejudice to the 
nonunitary condition. 

The structure of the task permits the following 
systematic and exhaustive way of scoring ап 
ordering the data. 1. Form and mode correctly ге- 
produced and correctly paired. We designate these 
recalls as (ЕМ), the plus sign preceding the 
parenthesis denoting a correct pairing. For non- 
stimuli this category describes the correct repro- 
duction of a figure and its paired line, 2. Form 
and mode correctly reproduced but incorrectly 
These were instances in which a form was 
a mode that went with another form. 
We refer to these reproductions as —(ЕМ), the 
minus sign preceding the parenthesis denoting in- 
correct pairing. 3. Form alone correctly repro- 
duced. Here were included instances In which the 
mode was either omitted, or an extraserial mo i 
was introduced. Also belonging to this category 
were instances of a repeated reproduction of mode 
which had been previously scored. 4. Mode alone 
correctly reproduced. This category correspondec 
to the one just described under 3. | 

Scoring of the other recall tests was simpler. 
A reproduction (of form or mode, respectively) 
was first scored for correctness, whether or not it 


paired. 
joined to 


was correctly paired. It was also scored for асси" _ 


racy of pairing. We thus obtained scores for the 
total of correct recalls and of correct pairings ™ — 
Experiments 2 and 3. 


Results 


The Constitutive Effect 

Experiment 1. The free recall data of 
this experiment are summarized in Table + 
Since the scores of men and women were 
closely similar, they will be reported jointly: 

1. The total level of recall (or the геса 
of Г and М properties regardless of accu 
racy of pairing) was virtually identical И 
the U and non-U conditions. The mean 0 
total recalls (see column 5) was close (0 the 
50% level, a degree of difficulty highly d 
quate to the present purpose. soor : 
ranged from 7 to 14, non-U scores гапа” ( 
from 5 to 14. E 

2. Accuracy of joint recall was ^ A 
tially more frequent in the U condition 2%. 
Columns 1 and 2). It accounted for 76. on 
of all U recalls, for 49.2% of all non- -... «, 
calls. By the Mann-Whitney test the MU 
ence is significant at beyond р = 001. 
tests were two-tailed.) 


Е 
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3. Incorrect pairing was far more fre- 
quent among the nonunitary stimuli. There 
was а total of nine such instances in U, of 
37 in non-U; (р = .001). Two-thirds of 
the U Ss did not pair incorrectly, and only 
one S did so more than once. Not only did 
a far greater proportion of non-U Ss mis- 
match the a and b terms; they also did so 
more often than U Ss. 


It was stated earlier that repeated repro- 
ductions of a form or mode were not 
counted in the scoring. This decision had 
consequences for the scoring of incorrect 
pairings involving repeated reproductions 
of form or mode, which will now be de- 


scribed. 

(a) When an S paired incorrectly a form or 
mode which he had previously paired correctly, it 
was not scored as an incorrect pairing. If, for ex- 
ample, the S had reproduced Е» Ms, and it was 
preceded or followed by F» M, the latter was 
counted as an instance of correct pairing, and the 
former was scored simply as a reproduction of Е», 
and was included in Column 3. (b) Occasionally, 
two wrong pairings occurred involving a repetition 
of form or mode; for example, the reproduction 
of Е, Ms, and subsequently of Fs Mo. In that case 
the order of recall determined the scoring; the first 
wrong pairing was retained and the second one 
was altered to read Ез, and again entered in Col- 


umn 3. (c) If a wrong pairing, say Е, Ms, was 
followed or preceded by the isolated reproduction 
of Е, or Ms, the former was scored as an incorrect 
pairing, and the latter was dropped out of the 
scoring system. Thus, the absolute number of in- 
correct pairings was somewhat larger than appears 
in Table 2; however, an uncorrected procedure of 
scoring would not alter the finding described above. 
(4) As implied in (а), an accurate pairing was 
scored correct without exception. 

4. Recall of only one of the paired terms 
(of F or M alone) comprised 15.4% of all 


U recalls, 19.5% of all non-U recalls. 


5. In the case of the isolated recalls just 
referred to above, U stimuli produced a 
small but decided preponderance of form 
(F) recalls, and non-U stimuli showed a 
similar preponderance of M recalls. This 
finding reappears consistently in subsequent 
experiments (see p. 11). 

The predominance of recall of modes in 
the non-U condition probably reflects an 
actual difference of complexity between 
them and the forms. The necessity of pre- 
paring manageable U stimuli required the 
selection of the simplest modes; many of 
these were, moreover, highly familiar con- 
ventional symbols. Why, then, were not the 
modes also better recalled in the U condi- 


TABLE 1 


RECALL SCOR 


ES OF UNITARY AND NONUNITARY STIMULI 


Experiment 1: Free Recall Test 


2 3 4 5 
Correctly [Incorrectly Recall Recall Mean 
Paired Paired of of of 
Recall Recall Form Mode Total 
y +(FM) | = (ЕМ) Alone Alone Recall 
Unitary ранні 24 M 7.8 0.8 1.2 0.5 10.2 
% 76.2 7.4 11.9 4.5 
Nonunitary Stimuli 24 M 4.8 sd 0.8 1.1 9.8 
49.2 31.4 8.1 11.4 


Note.-Each fo mos 
and io den correctly paired; 


in Column 2. 
The percentage 
Paired forms and m 


lues represent the 
nodes in the unitary 


ode is treated as a unit of recall. Thus 
RN the mean number of pairings is one 
number of recall units in a given category divided by the total recall, Thus, correctly 


series comprised 76.2% of all items recalled. 
rcentage values were computed by dividing the total scores of the respective categories. 


the figure 7.8 in Column 1 stands for the mean number of forms 
half of this value, or 3.9. This is also the way to read the entries 


In this and in all succeeding deviate slightly from, and will be more exact than, those obtained by dividing the respective means. 


d by dividing the total score by N; these will therefore be more exact than the 
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tion? It is probable that form was the domi- 


nant aspect of the U stimulus (see also p. 


30). 

Experiments 2 and 3. These experiments 
employed two other methods of recall. As 
described earlier, we tested for recall of the 
mode given the continuous form (which we 
designate as F—M), and for recall of the 
form given the mode (or MF), both un- 
der U and non-U conditions. In the recall 
test of Experiment 2, both groups of Ss 
were given the forms drawn continuously, 
each appearing on the left of its page, cor- 
responding to the position of the learning 
stimuli. In order to preserve the constancy 
of conditions between the non-U groups of 
Experiments 2 and 3, the learning stimuli 
were in the latter case drawn with the 
modes at the left and the continuous forms 
at the right. At the time of test the dupli- 
cates of the modes were also shown at the 
left, to correspond to their position during 
learning. 

The results, which appear in Table 2, con- 
tain the frequencies of (a) correctly paired 


The results were markedly similar to 
those obtained with the free recall test of 
Experiment 1. 

1. The levels of recall of U and non-U 
stimuli were very similar in Experiments 2 
and 3. Again, mean recall was approxi- 
mately at the 50% level.’ 

2. Accuracy of joint recall again was sig- 
nificantly higher in each of the U conditions. 
In Experiment 2, the percentage of cor- 
rectly paired U recalls was 77.8, of non-U 
recalls 40.7% (Р = 001). In Experiment 3, 
the corresponding percentages Were 74.6 
and 48.5 (p = .01). Differences in incor- 
rect pairing, of course, Were identical TM 
those just described. T hese results are al 
the more notable when one considers that 
the test stimuli were identical with those 0 
the learning series in the non-U condition, 
while differing from those of the U series. 
3. The results obtained in Experiments 2 
and 3 were closely similar. That is to 58); 
F and M terms evoked each other with 


equal frequency. 


recalls and (b) of items correctly recalled 7 The scores of Table 2 refer, of course, to recall 
but wrongly paired. of forms or modes. 
TABLE 2 
RECALL SCORES OF UNITARY AND NONUNITARY STIMULI 
Experiments 2 and 3: Paired Associate Recall Test 
1 2 3 
Correctly | Incorrectly Mean Е 
iti N Paired Paired Tota 
— Recall Recall Recall 
Experiment 2: Recall of Mode (Е—>М) 
Unitary Stimuli 12 M 4.1 1.2 5.3 
96 77.8 22.2 Р 
Nonunitary Stimuli 12 M 2.0 2.9 4. 
% 40.7 59.3 
Experiment 3: Recall of Form (M——F) 5 
Unitary Stimuli 12 M 3.7 1.3 4. 
% 74.6 25.4 a 
Nonunitary Stimuli 12 M 2.1 2.8 7 
% 48.5 51.5 
Experiment 3a: Recall of Form (М——Е) , 
Nonunitary Stimuli 12 M 1.9 1.3 3. 
% 60.5 39.5 
of cout?" 


Note. 
identical. 


—In this table the number of items recalled in a given category and the numb: 


er of pairings in that category їз, 
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This finding needs to be qualified at one 
point. We repeated the non-U variation of 
Experiment 3, this time placing the modes 
at the right, both during learning and recall ; 
- we refer to this condition as Experiment За. 
Thus, at the time of test the S had before 
him the right-hand member of a pair and 
was attempting to recall the left-hand mem- 
ber. The results, which will be found in 
Table 2, show that this procedure depressed 
recall significantly, to 57% of that obtained 
in the corresponding non-U variation of 
Experiment 3 (р = .005). Accuracy of 
joint recall was depressed absolutely but not 
relatively; the decrease did not reach sta- 
tistical significance. е 
Since the conditions of learning were in 
all respects identical with those prevailing 
in the preceding experiments, the drop in 
recall of forms was a direct consequence of 
the method of test, and demonstrates that 
association of the non-U stimuli was polar- 
ized. Although the pairs were exposed 
simultaneously, and though the instructions 
did not weight the members differentially, 
they were psychologically successive, and 
were mastered in a left-right direction. 
There is no analogue to this asymmetry 1n 
the U condition. This result thus describes 
а further difference between the contrasted 
Conditions: U-pairings were not polarized, 
™ contrast to non-U pairings. 
_4. Experiments 2 and 3 did not produce 
higher levels of correct pairing than did Ex- 
periment 1. This result is somewhat sur- 
prising when one considers that recall was 
of the paired-associate form, in contrast to 
the free recall procedure of Experiment 1. 
It would have been reasonable to expect that 
some of the isolated recalls of Experiment 1 
Would be converted into correct pairings in 
the present variations. A hint of what oc- 
curred is provided by {Бей orsi Hires 
quency of incorrect pairings, both in the U 
and поп-О variations. Apparently, aided 
recall introduced its Own constraints; a 5 
had to respond correctly at a given point, or 
be penalized, whereas the free recall proce- 
| dure permitted him to repeat any combina- 
tion of form and mode, and thus to correct 

his earlier responses. That the levels of total 
| recall were closely similar to those of Ex- 


| 
LO 


periment 1 is not surprising, since the con- 
ditions of learning were not altered. 


Confirmatory Experiments 


Experiment 4: Combined U and non-U 
Conditions. The preceding experiments 
compared homogeneous U and non-U series. 
The following variation combines these 
within the same experiment, a procedure 
that permits each S to serve as his own 
control. 

From the respective series of Experiment 1 we 
derived two new series, each consisting of 5 unitary 
and 5 dual stimuli, There were two groups of Ss. 
The stimuli that were given in U form to one 
group appeared in non-U form for the other group, 
and conversely. In assigning stimuli to the two 
categories, we chose sets equated for difficulty on 
the basis of the data of Experiment 1. (The U 
stimuli for group 1 were: 1, 4, 6, 7, and 10.) 

We also rotated the order of presentation of the 
stimuli, so that each stimulus would appear in each 
of the 10 positions. The order of presentation of 
U and non-U stimuli was, for the first S, 
ABBABAABBA, This order was shifted one step 
for each succeeding S. 

The instructions described in detail the two kinds 
of stimuli, with the help of an illustrative example 
of each. Again, the Ss were set for a test of free 
recall. They were cautioned explicitly to "make 
every effort to pay equally close attention to each 
card." 

Testing was with the method of free recall. Ss 
were urged to reproduce part of each stimulus 
when they could not recall it entirely and to guess 
when not certain. The procedure was in all re- 
spects identical with that of Experiment Ц 
There were 20 Ss, 10 in each group, equally 
divided among men and women. 

a. Results. The overall comparisons be- 
tween the U and non-U stimuli, obtained by 


pooling the results of both groups, appear in 
Table 3. к 

1. The difficulty of the tasks, measured in 
terms of recall of content, was closely simi- 
lar to those of Experiment т. 

2. U and non-U stimuli produced virtu- 
ally identical total recall in one series; in the 
other series the U stimuli showed higher 
recall. 

3. Under the present conditions, the dif- 
ferences between U and non-U stimuli in 
accuracy of joint recall were considerably 
accentuated. Correctly paired recalls ac- 
counted for 88.7% of all U recalls, for 
29.3% of all non-U recalls (р = .001). 


10 


4. The differences in inaccurate pairing 
were even sharper. There was only one such 
instance in U, but inaccurate pairings com- 
prised 37.8% of all non-U recalls. 

5. Sixteen out of 20 Ss were superior in 
pairing correctly the U stimuli; three 55 
performed equivalently ; and there was one 
difference in favor of the non-U stimuli. 
(Of the Ss who went contrary to the pre- 
dominant trend, three recalled very poorly 
throughout, and one obtained a nearly per- 
fect recall.) 

6. Of the total of 10 stimuli, 9 produced 
more correct pairings in U; one stimulus 
failed to show a difference. 

7. The reproductions showed a remark- 
able lack of confusion between the two sets 
of stimuli. (a) There was a total of 16 in- 
stances of incorrect pairing. Only one of 
these involved the combining of a form 
from one set with a mode from the other 
set; all other mispairings occurred within 
the same set. (b) Further, the U or non-U 
character of the correctly paired stimuli was 
almost faultlessly retained. In only one in- 
stance were the form and mode of a U 
stimulus recalled as separated; in no case 
was a pair of non-U stimuli reproduced in 
a joined manner. These results lend support 
to the conclusion that the boundaries be- 
tween the two sets of stimuli were strong. 

Under the present conditions, the differ- 
ences between the stimulus conditions in- 
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creased sharply, when compared with Ex- 
periment l. It would appear that, in the 
competition for recall, U stimuli had a de- 
cisive advantage. This finding might be 
questioned if the Ss had deliberately favored 
{һе С stimuli during learning. As men- 
tioned earlier, the instructions attempted to 
guard against this contingency. As a further 
the Ss were questioned about their 
procedure at the conclusion of the 
Twelve Ss reported paying 
both kinds of stimuli, 5 Ss 
13 reported that they had 
studied the non-U stimuli more carefully. 


According to these reports, there was no 
the procedures the Ss 


check. 
learning 
experiment. 
equal attention to 
favored the U, anc 


particular bias in 
followed. | 

It may be pertinent to mention bs 
greater number of Ss expressed the VIEW 
that U stimuli were easier because simpler. 
“There was only one thing to remember 1 
the single figures, two things to remember 
in the double figures"; “The double ones 
sort of split my attention"; “There was 
more to remember with the double figure? ^ 
two separate objects." 

The results thus confirm the main 
ings previously reported concerning the 
ferences between the U and non-U c0 
tions. 

Experiment 5: Randomized Pairing of 
Form and Mode. In the preparation С 
stimuli for the preceding experiments м 


that the 


find- 
1 f- 
ndi- 


TABLE 3 


RECALL SCORES OF UNITARY AND NONUNITARY 
Experiment 4: Combined Unitary and Nonunitary Condition — 


STIMULI 
Free Recall Test 


H 


1 2 3 4 5 
Correctly |Incorrectly| Recall Recall меа 
Paired Paired of of Total 
N Recall Recall Form Mode 5 
+(FM) | —(ЕМ) Alone Alone Re 
3 
Unitary Stimuli 20 M 4.7 0.1 0.3 0.3 5. 
| 
% 88.7 0.9 5.1 4.7 C 
; — 4. 
Nonunitary Stimuli 20 M 1.2 1.6 0.5 0.8 NN. 
% 29.3 37.8 12.2 20.7 
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sought to avoid combinations of forms and 
modes that might be similar or outstand- 
ingly fitting. Since there was no assurance 
that we excluded such selectiveness com- 
pletely, the following variation was done as 
a check upon this possibility. 


'The forms and modes of the preceding experi- 
ments were now randomly paired, We numbered 
the modes from 1 to 10, drew these numbers and 
assigned each to the forms, in the order they had 
in the earlier learning series. We thus obtained a 
new pairing of F and M properties. Three of the 
original stimuli (Nos. 5, 7, and 8) were line forms; 
these we changed into closed areal forms (in 
preparation for a further variation to be described 
in a subsequent section). These stimuli, which will 
be employed frequently in subsequent experiments, 
are reproduced in Figure 3. 

The procedure, instructions, and methods of scor- 
ing were identical with those of Experiment 1. The 
test was one of free recall. There were 24 Ss in 
each variation, 12 men and 12 women. The sum- 
mary data appear in Table 4. 


1. Recall of content, or of the forms and 
modes, was again virtually identical in the 
U and non-U conditions. The mean levels 
of recall continued to be similar to those of 
Experiment 1 (see Table 1). 

2. U stimuli produced far more correct 
pairings than non-U stimuli (65.596 and 
38.1%, respectively, of all recalls: р = 
.001). Inaccurate U pairings were far less 
frequent than inaccurate non-U pairings (р 
= .001). Аз in Experiment 1, U Ss either 
did not mismatch, or did so no more than 
once; mismatching occurred among more 
non-U Ss, and with higher frequency. 

3. Also in agreement with the results of 
Experiment 1 is the finding that isolated 
recalls of U forms were more frequent than 
of U modes, but that in the non-U series 
there was a predominance of recall of 
modes. 


We have thus confirmed the previous 
findings, and demonstrated that the effect 
persists under modified stimulus conditions. 


Index of the Constitutive Effect 


Since the principal difference between U 
and non-U stimuli concerns accuracy of 
joint recall, it would be helpful to express 
the effect in such а manner а$ to permit 


comparison among different. experimental 


conditions. One index that recommends it- 
self for this purpose is the ratio of correct 
to incorrect pairings. In order to avoid com- 
plications due to zero scores, we adopted as 
the index the difference between correct and 
incorrect pairings divided by their sum. 
This index varies between +1. When in- 
correct pairings are zero, the value is 1; 
when correct pairings are zero the value is 
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Fic. 3. Unitary and nonunitary stimulus series: 
Experiment 5. 
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TABLE 4 


RECALL SCORES oF UNITARY AND NONUNITARY STIMULI 


Experiment 5: Free Recall Test 
1 2 3 4 5 
Correctly Incorrectly} Recall Recall Mean 
Paired Paired of of of 
N Recall Recall Form Mode Total 
+(FM) —(FM) Alone Alone Recall 
Unitary Stimuli 24 M 6.3 1.23 1.6 0.4 9.7 
% 65.5 13.8 16.8 3.9 
Nonunitary Stimuli 24 M ый 3.8 1.0 1.8 9.6 


—1; and the index takes on the value zero 
when both kinds of pairing occur with equal 
frequency. 

We accordingly computed the values of 
the index for each S. Table 5 summarized 
the data for Experiments 1 to 5, and of a 
number of subsequent experiments employ- 
ing the method of free recall. We computed 
the median values of the index in each con- 
dition, the proportion of 55 with exclusively 


correct pairings, and the proportion of Ss 
with predominantly incorrect pairings. ^P 
obtaining these values we grouped together 
Experiments 1 and 5, and also Experiments 
2 and 3. 

The U and non-U series differed con” 
sistently for all selected values of the index; 
The U medians of Experiments 1 to 
ranged from 1.00 to 0.7, the non-U medians 
from 0 to 0.3. The proportions of +1 values 


TABLE 5 


INDEX oF PAIRING ACCURACY 


Correct Pairings — 


Incorrect Pairin 3 


Correct Pairings + Incorrect Pairings 


Conditions 


Experiments 1 and 5 combined: 

Unitary Stimuli 

Nonunitary Stimuli 
Experiments 2 and 3 combined: 

Unitary Stimuli 

Nonunitary Stimuli 
Experiment 4 

Unitary Stimuli* 

Nonunitary Stimuli* 
Experiment 7 

Unitary Stimuli 

Nonunitary Stimuli 
Experiment 11 
Experiment 12 

“Coherent” Stimuli* 

“Not coherent” Stimuli* 


+1 Scores Negative Scores 
Md. (in %) (in %) 
ME mE ج‎ Е 
1.0 54 2 
0.3 21 23 
0.7 42 13 
0.0 0 25 
1.0 95 0 
0.0 25 45 
0.6 94 0 
0.1 50 25 
0.0 25 21 
0.9 50 4 
0.1 21 33 


» Based оп 5 stimuli. 
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and of negative values exhibited pronounced 
differences in the same direction. 

The following analysis provides further 
evidence of the differences in pairing be- 
tween and within units. When the method 
of test was that of free recall, there were 
instances of the isolated recall of both the 
form and mode of the same stimulus, or of 
the mispairing of either with another mem- 
ber of the series. The procedure of testing 
leaves the S free to pair correctly at some 
later point those items which he had previ- 
ously recalled but not paired, or which he 
had mispaired. We therefore ask: When 
the form and mode of a given stimulus were 
recalled separately, what was the likelihood 
that they would also be eventually correctly 
paired ? 

Table 6, which contains the relevant data 
for Experiments 1 and 5, demonstrates а 
marked and consistent difference between U 
and non-U stimuli in this respect. There 
was a total of 10 instances in the U varia- 
tion of isolated recall of form and mode of 
a given stimulus without eventual correct 
pairing of the same terms; in the non-U 
variations there were 42 such instances. 
That is to say, when both form and mode 
of a unitary stimulus were recalled, they 
were as а rule also correctly reunited ; this 
was far less often the case for nonunitary 


stimuli. 
Control Experiments: Recognizability and 
Intraserial Interference 


We have inferred that the preceding re- 
sults were directly a function of the respec 
tive stimulus relations—of the constitutive 
relation and of the relation of duality be- 
tween paired heterogeneous contents. Be- 
fore one can safely draw this conclusion, И 
is necessary to determine whether the 
stimulus conditions differed also in other 
ts, and whether the latter might be 
ble for the obtained results. The 
experiment and the one following 
d with this question. 

6: Recognizability of Unitary 
Stimuli. It is by no means 
he U and non-U stimuli 
t or identifiable. The 


respects, 
respons} 
present 
are concerne 

Experiment 
and Nonunitary 
established that th 
were equally distinc 


rationale of this investigation obviously re- 
quires that the experimental effect should 
not be a function of differences of this kind. 
The object of this variation, accordingly, 
was to compare the identifiability of the 
forms and modes in the two series. 

a, Procedure. As the criterion of distinctness or 
clarity we chose the recognizability of the forms 
and modes. Except for the final test, the stimuli, 
instructions, and other conditions of learning were 
identical with those of Experiment 5. As before, 
the Ss were instructed to pair form and mode in 
anticipation for a test of free recall. However, the 
test that followed, and which will now be described, 
was one of recognition. 

The test booklets carried exact reproductions of 
each form and mode, one to a page; each appeared 
in the center of the page. The forms were all 
drawn continuously, and the modes were in linear 
array. Included in the test were 10 new forms and 
10 new modes. The task was to check a “yes” or 
“no” printed on each page, indicating whether or 
not the given stimulus had been part of the learn- 
ing series. The Ss were informed that new stimuli 
would be interspersed in the test series, Again, the 
test of recognition came 3 min. following the pre- 
sentation of the learning series. 

The sequence of stimuli was as follows. The old 
forms and modes were each divided into 2 blocks 
of 5. To each of these 4 sets we added a set of 5 
new forms or modes. Thus, the sequence consisted 
of 4 blocks of 10 stimuli, each containing 5 old and 
5 new forms or modes. The order of the blocks 
was varied with succeeding Ss; there was thus a 
total of 4 orders, Also, the order of the critical 
stimuli within each block was rotated from S to 5; 
the noncritical stimuli retained the same position 
within a block. 

The Ss were Princeton University undergradu- 
ates, 16 in each group. 

b. Results. The percentages of correct 
recognitions of the critical stimuli, and of 


TABLE 6 


FREQUENCY OF FAILURES TO PAIR RECALLED FORM 
AND Море BELONGING TO THE SAME STIMULUS 


Unitary 
Stimuli 


Nonunitary 
Stimuli 


F 
Experiment 1 4 
Experiment 5 6 
Experiment 7 53 
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TABLE 7 
RECOGNITION OF FORMS AND MODES 
(in %) 


Experiment 6 


Critical Noncritical 
Stimuli Stimuli 
Unitary Stimuli 
Form 86 91 
Моде 69 67 
Nonunitary Stimuli 
Form 90 94 
Mode 83 92 


correct nonrecognitions of the noncritical 
stimuli, appear in Table 7. 


(a) The level of recognition of the crit- 
ical stimuli was, with one exception to be 
noted below, quite high. (b) Recognition of 
critical forms was superior to recognition of 
critical modes. By the sign test, this differ- 
ence was significant in the U series (р < 
01), not in the non-U series. (c) Critical 
forms were equally recognizable, whether 
they were unitary or not. (d) However, the 
recognition of non-U modes was higher 
than that of U modes (р = .02). (e) Non- 
recognition of the noncritical stimuli was, 
with one exception, slightly but consistently 
superior to the recognition of critical 
stimuli. 

The test stimuli were most similar to 
those of the non-U series. This circum- 
stance fails, however, to account for the 
difference in recognition of unitary and non- 
unitary modes, when one considers that the 
forms were equally recognizable in the two 
series. We conclude that there was a tend- 
ency for form to be dominant over mode in 
the U series. This is not surprising in the 
light of the distinctive functions of form 
and mode in the U stimulus, and of their 
comparative similarity of function in the 
non-U stimulus.* 


8 Consistent with the present data is the finding 
reported earlier of a slight but regular superiority 
of form over mode recall in the U series. The re- 
sults are not, however, consistent with the corre- 
sponding finding that non-U modes were somewhat 
better recalled than forms. 


While it is difficult to state precisely the 
relation between these data of recognition 
and the earlier data of recall, we may draw 
the following limited conclusion. Since the 
level of recognition sets limits to the pair- 
ings that can occur, differences of stimulus 
clarity did not favor the U series. If any- 
thing, the reverse was more likely the case; 
10 the extent that this factor played a role 
it could only favor pairing in the non-U 
series.” 

As a further demonstration of the same 
point, we may cite the following analysis. 
From the recognition data we obtained a 
measure of “synthetic pairing," namely, of 
the incidence of recognition of form and 
mode belonging to the same unit ог to the 
same pair. This measure presumably sets a 
limit to the pairings that can occur. There 
were 95 such synthetic pairings in the U- 
group, and 122 in the non-U group (p = 
02). These comprised 76% of all recognt 
tions in the U-series, and 90% of all recog- 
nitions in the non-U series. Again, we have 
to conclude that the conditions of registra- 
tion tended to favor the non-U series; if 50; 
the recall data of the preceding experiments 
tend to underestimate the effect of the co 
stitutive relation on pairing. 


Experiment 7: The Role of Intraserial 
Interference. It might be claimed that the 
stimulus series were not equally homogene- 
ous. The non-U series consisted of forms 
all of which had continuous boundary lines, 
and of other forms all of which appeared “ 
line aggregates. This circumstance п 
make them more vulnerable to intraser!4 
interference than the apparently more inc! 
vidualized U stimuli. 

In the variation to be now reported, the 
Ss were afforded an opportunity to becom 
familiar with the individual forms ап 
modes prior to the main task. The proce 
dure of familiarization raised notably the 
distinctness and recallability of the term* 


9 One must caution, however, against inferring 
too directly from recognition to recall. When al 
stimuli (of each series) were ranked, respectiV nd 
for ease of recognition in the present variation ola’ 
for ease of pairing (in Experiment 5), the corf 
tions were low and lacked significance. 
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before the main task was introduced. This 
step could only have the effect of reducing 
the levels of subsequent intraserial inter- 
ference, and óf equalizing the levels in the 
respective series. We reasoned that the ex- 
perimental effect should be similarly re- 
duced if it were a function of serial inter- 
ference. 


a. Procedure. The following was the procedure 
of familiarization. The forms and modes of Ex- 
periment 5 were drawn on separate cards, and in 
the positions they previously occupied—the forms 
at the left, the modes at the right. 

One half of the Ss were shown first the set of 
10 forms, in succession. The period of exposure 
of each form was 4 sec. The series was shown 3 
times; before cach trial the order was changed by 
shuffling the series. Ss were instructed to study 
the forms so as to be able to reproduce them cor- 
rectly. The third trial was followed by a test of 
free recall, Thereupon, the Ss were shown the 
series of 10 modes under identical conditions, and 
again recall was tested. The other half of the Ss 
were first shown the modes, and then the forms. 

After a lapse of 3 min. we proceeded to a repe- 
tition of Experiment 5, Ss were assigned to the 
unitary or nonunitary groups on the basis of the 
preceding tests; the two groups were closely 
matched. The procedure of Experiment 5 was 
maintained without change, except that it was pos- 
sible to point out during the instructions (with the 
help of an illustrative stimulus) the relevance of 
the preceding familiarization to the task. 

b. Results. The procedure of familiariza- 
tion produced a high level of recall of forms 
and modes ( Table 8, first column). Further, 
the level of recall was virtually identical for 
the two groups of Ss. There was а slight 
but consistent superiority in the recall of 


modes; for the entire group of 32 Ss, the 
superiority of modes was, by the sign test, 
significant at beyond .01. 

The data concerning pairing are also sum- 

marized in Table 8. 
‚ 1. The chief finding is the superiority of 
joint recall in the U condition. Ss in the 
latter group showed a mean of 5.3 correct 
pairings; the mean of non-U pairings was 
2.3 (p < .001). Each unitary stimulus pro- 
duced more correct pairings than its non- 
unitary counterpart. 

2. Under the present conditions the U 
and non-U series did not produce signifi- 
cantly different frequencies of incorrect 
pairings; the respective means were 1.4 and 
1.9. 

Familiarization significantly raised the 
frequency of correct U pairings, compared 
with the level obtained in Experiment 5 (р 
< .01), but did not significantly alter the 
level of non-U pairings. Further analysis 
demonstrates, however, that familiarization 
did not alter the relative frequencies of ac- 
curate and inaccurate pairings in either 
series, in comparison with Experiment 5. In 
order to allow for the differences in abso- 
lute level of recall, we took the total of ac- 
curate and inaccurate pairings, and com- 
puted the proportion of each to the total. 
The results appear in Table 9. The ratios 
are substantially similar to those of Experi- 
ment 5. The differences in absolute level 
tell us that familiarization increased the at- 


MEAN RECALL Scores or UNITARY AN 


TABLE 8 


р NoNUNITARY STIMULI FOLLOWING FAMILIARIZATION 
Experiment 7: Free Recall Test 


1 3 4 5 6 
Mean of In- Mean of 
Total Correctly correctly Recall Recall Total 
N Recall Paired Paired of Form of Mode Recall 
before Recall Recall Alone Alone after 
Pairing + (FM) —(FM) Pairing 
Unitary Stimuli 16 18.0 5.3 1.4 2.4 2.6 18.3 
Monanitary Stimuli 16 17.8 2.3 1.9 4.1 4.8 17.1 


Note.-The mean 
f Column 6 were о 


values in Co 


btained durin 


п 1 are the recall scores obtained following familiarization and before pairing. The values 


lum; 
g the final test of free recall. 
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TABLE 9 


PROPORTION OF CORRECT PAIRINGS TO 
Torar PAIRINGS 


Experiments 7 and 5 


| 
Correctly Incorrectly 
Paired Paired 
| Recall Recall 
+(FM) —(FM) 
Experiment 7 
Unitary Stimuli 78 22 
Nonunitary Stimuli 54 46 
Experiment 5 | 
Unitary Stimuli 83 17 
Nonunitary Stimuli 53 AT 


tempts at pairings, but not the proportions 
of the kinds of pairing. 

3. It follows that isolated recalls must be 
more frequent in the non-U series. This 
was the case; the mean of isolated non-U 
recalls was 8.9, of isolated U recalls, 4.9. 

4. The index of pairing accuracy revealed 
a clear-cut difference between the two ex- 
perimental conditions (see Table 5). 


5. Finally, we again found that when 
form and mode of the same stimulus (or 
pair) were recalled, they were also correctly 
paired with far higher frequency when uni- 
tary. Of all recalled forms and modes be- 
longing to the same stimulus (or pair), 6296 
Were correctly paired by the U group; this 


value dropped to 30% in the non-U group 
(Table 6).19 


6. The total mean recall, regardless of 
pairing, was 18.3 for the unitary series, and 
17.1 for the nonunitary series; this differ- 
ence was not significant (see Table 8, last 
column), | 

These results were obtained after a high 
degree of familiarization with the contents 
of the series. There can be little doubt that 
this procedure, which equalized the identifi- 
ability of the stimuli for the two groups, 
also reduced and tended to equalize inter- 


19 Table 6 shows that failures to pair were fre- 
quent in this condition, in comparison with earlier 
variations. This was directly a consequence of the 
procedure of familiarization. 


ference during subsequent pairing. Had the | 


experimental effect been a function of inter- 
ference, it should have been markedly ге- 
duced under these conditions. Actually, the 
effect became more pronounced. The ratio 
of correct U pairings to correct non-U patr- 
ings was 2.3; in the corresponding variation 
of Experiment 5, the ratio was 1.7. . 

We conclude that the experimental effect 
was not a function of differences of intra- 
serial interference. Further, it seems to US 
that this variation, which eliminated E 
confounding due to the concurrent tasks о 
mastering the terms and pairing them, pro 
vides a more accurate approximation of the 
experimental effect, which the earlier results 
underestimated. 

These findings also confirm t er 
sions of the preceding variation. Equaliz 
the distinctness or identifiability of the st! р 
uli had the effect of accentuating the diffe 
ence between the conditions." 


he conclu- 


The Scope of the Unitary Effect 


" : vestiges 
In this section we continue the inv siye 
tion by varying further the conditions qe 
learning and recall, in order to establish 
scope of the unitary effect. 
р ту : based 


The results so far reported wer г 
entirely on the data of recall. In men 
investigation an obvious and important a 
tinction is that between the presence ОГ! 
tention of an association (or, 
erally, а pairing) and its recall, since а f д 
tional connection may be present but hil 
available. We can only conclude from M 
has preceded that unity was a conditi A 
favoring recall; it remains to be звона 
whether the effect extended to the form 


tion proper of pairs. 


n =.4 " 

11 Why did familiarization raise the absolt 
level of correct U pairings, and not of non- Psy? 
ings (compared with the levels of Experiment ari’ 
We interpret these results in the light of. E 
ment 6. There we saw that the recognizability, wis 
presumably the perceptibility, of the U modes the 
comparatively low. Familiarization increase = 
distinctness of U modes, and thus raised jon је 
call. But the non-U series, being more рег Ер za 
could profit less from the procedure of familia 
tion. 
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As a first step in this direction we em- 
ployed the test of matching. This procedure 
reduces considerably the burden placed 
upon the recall of contents. When the A 
and B terms are given at the time of test, 
the sole task of the S, once he has recog- 
nized them, is that of recalling the pairing 
they had initially. The procedure of match- 
ing thus provides a more direct test of the 
recall of an association, or of pairing. 

Experiment 8: The Procedure of Match- 
ing—Intentional Learning. Experiment 5 
was repeated with new groups of Ss. The 
experiments were now done in groups rang- 
ing in size from 6 to 10. 

The stimuli were projected on a screen with 
standard size lantern slides. The projected size of 
the images was controlled, by setting the width of 
stimulus 4 at 12 in. Ss sat at a distance of 6 to 12 
ft. from the screen. They were placed as close to 
the center as possible, their distance from the 
center of the screen not exceeding 35° — 40°. (As 
a further precaution, the Ss were asked, at the 
conclusion of the experiment, to state in writing 
whether they had been able to see the stimuli dis- 
tinctly; all answered in the affirmative.) 

The other conditions of learning were unchanged. 
Learning was intentional; аз before, the instruc- 
tions set the Ss to anticipate a test of free recall. 
An illustrative card was now introduced during 
the instructions, containing а unitary (or non- 
unitary) stimulus, in order to insure that the task 
would be understood (and also to make possible a 


comparison with a subsequent variation to be de- 


scribed below, which necessitated the illustrative 
1 of 4 sec. was intro- 


stimulus), Also, an interval i 
duced between the exposures of successive stimuli, 
(again for the purpose of comparison with a sub- 
sequent variation that required this procedure). 

A side task of anagram problems immediately 
followed the presentation of stimuli, After 14 min., 
booklets containing the forms, drawn in continuous 
outline, were distributed; each form appeared on a 
separate page, The order of the forms was varied 
from $ to 5. A matching sheet containing the 10 
modes was distributed at the same time, each 
drawn as a straight line. The modes were placed 
in a random sequence (constant for all Ss), and 
were numbered in order from 1 to 10, The instruc- 
tions for the matching test were then read. The Ss 
were directed to turn to the first form in the book- 
let, to select from the matching sheet the mode that 
went with it previously, and to enter its number 
next to the form. They were to guess when un- 
certain, and they were free to select a given mode 
more than once, but they were not to return to a 
page once they had passed it. А 

When this test was completed, the matching 


sheets were collected. Thereupon, the Ss were given 


a modified test of recognition. They were asked to 


TABLE 10 
RECALL ScorEs or UNITARY AND NONUNITARY 
STIMULI 
Experiments 8 and 9: Matching Test 
Unitary Nonunitary 
stimuli stimuli 
Mean Mean 
N | Correct | N | Correct 
Matchings Matchings 
Experiment 8: 
Intentional 20 6.1 19 3,2 
Experiment 9: 
Incidental 11 4.1 15 Ert 


inspect again the forms in their booklet, and to 
indicate whether they recognized each as one that 
had appeared in the original series by writing “yes” 
ог "no" next to it, Ss were warned not to alter 
their earlier matching responses; this they were in 
any case unlikely to do, since they were at this 
point without the matching sheets. 

The Ss were women students at Bryn Mawr 
College, 20 in the U variation and 19 in the non-U 
variation. 

Unitary stimuli produced clearly superior 
matchings, as Table 10 shows. The mean of 
correct U matchings was 6.1; the mean of 
non-U matchings was 3.2 (р = .0002). 

Те would have been instructive to compare 
the levels of matching with the levels of 
correctly paired recall in the earlier experi- 
ments. This we are unable to do, since the 
present procedure involved a change in the 
mode of exposure, and the introduction of 
an interval between exposures. The means 
of correct matchings are virtually identical 
with the means of correctly paired recalls in 
Experiment 5, but for the reasons stated we 
can draw no inference from this finding. 

The data concerning the recognition of 
forms, and their relation to accuracy of 
matching, appear in Table 11. Failures to 
recognize the forms were infrequent, almost 
equally so in both variations, the propor- 
tions of such occurrences being 7.5% іп О, 
and 10.0% in non-U. Although the pro- 


12 Despite the small number of failures to recog- 
nize, they were not randomly distributed among the 
stimuli; certain forms (eg, 1,5,6) were always 
recognized, by both groups; others (eg, 9,10) 
produced relatively high failures to recognize in 
both groups. й 


18 


cedure of testing suffered from certain 
shortcomings—it was preceded by a test of 
recall, and it lacked noncritical items—the 
levels of recognition were closely similar to 
those obtained in Experiment 6 (see Table 
7). We did not test for recognition of 
modes, but it is reasonable to assume that 
these levels were again similar to those of 
Experiment 6. The results just described 
highlight the fact that the U and non-U 
conditions differed markedly in matching at 
the same time that they were virtually iden- 
tical with respect to recognition of forms. 
When unitary forms were recognized, 
they produced correct matches in 63.8% of 
the cases; correctly recognized nonunitary 
forms produced correct matches in only 
31.6% of the cases. Thus, recognition of U 
forms went with correct matching in the 
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majority of cases, and recognition of non- 
unitary forms went with incorrect matching 
in the majority of cases. 

The present results demonstrate that the 
superiority of the unitary condition persists 
when the burden of item recall is reduced 
or eliminated. 

All of the preceding experiments were 
done under conditions of intentional learn- 
ing. It would be possible to maintain that 
the differences obtained between unitary and 
nonunitary stimuli were а function of 
learned ways handling the respective 


tasks. If so, a procedure of incidental learn- 


of 


of paradoxical instances 
recognition and correct 


might be accounted for by 
s was small. 


13 There were а number 
of reported failure of 


matching. In part they mi 
guessing. The number of instance 


TABLE 11 


RELATION or RECO 


GNITION TO ACCURACY OF MATCHING 
Experiments 8 and 9 


Experiment 8: Intentional Learning 
| 
| Unitary Stimuli (V = 20) Nonunitary Stimuli (М = 19) 
Correct | Incorrect Correct | Incorrect 
Matching | Matching Total Matching | Matching Total 
Recognition в |1 | 6 185 54 117 171 
% 63.8 | 36.2 31.6 68.4 pe 
Failure of Recognition F 3 | 12 15 6 13 19 
% 20.0 | 80.0 31.6 68.4 
Experiment 9: Incidental Learning 
Unitary Stimuli (N = 11) Nonunitary Stimuli (N = 15) 
Correct | Incorrect Correct | Incorrect А | 
Matching | Matching | Total Matching | Matching Tota 
i | 8 
Recognition F ш | 50 91 25 105 13 
% 45.1 55.0 19.2 808 | 
Failure of Recognition Е 5 13 18 1 19 20 
% 27.8 72.2 5.0 95.0 
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ing should reduce the difference between 
them. Accordingly, we repeated the preced- 
ing experiment under incidental conditions. 


Experiment 9: The Procedure of Match- 
ing—Incidental Learning. The experiments 
were done in groups of 4 to 8. As in Ex- 
periment 6, the stimuli were projected on a 
screen in a darkened room. 

Тре task was that of judging the relative attrac- 
tiveness of the forms and modes. In the nonunitary 
condition the Ss stated whether the left (figure) or 
the right (mode) stimulus was the more attractive ; 
in the U condition they indicated the relative at- 
tractiveness of the overall form and of its modes. 
An illustrative card was shown, as described in 
Experiment 8, in each condition. The Ss were en- 
couraged to judge spontaneously; the judgments 
were recorded on a rating sheet. A 4 sec. interval 
was introduced between exposures, during which 
the Ss recorded their judgments. Special care was 
taken to insure that the Ss would attend to the 
stimuli during the entire period of exposure, which 
was also 4 sec. . 

Recall was tested, after an interval of 3 min, 
by the method of matching. The procedure of test- 
ing was in every detail identical with that employed 


in Experiment 8. 
The Ss were 26 women students at Bryn 


Mawr College. 

The results, which appear in Table 10, 
show that the incidental procedure markedly 
depressed the level of correct matching; in- 
deed, matching of the non-U stimuli was 
not significantly different from chance. The 
differences between the incidental and in- 
tentional matchings, for the U and non-U 
Stimuli, were significant, respectively, at р 
= 03, 

At the same time the U stimuli were 
matched with a considerable degree of accu- 
гасу, one that exceeded significantly the 
level of non-U matchings. The mean num- 
ber of correct U matches was 4.1, of correct 
non-U matches, 1.7 (р = .01). Indeed, the 
incidental matching of U stimuli was 
slightly superior to the intentional matching 
of non-U stimuli, a further indication of the 
effectiveness of the U condition. We con- 
it of interest that the U group con- 
tinued to match with a substantial degree of 
accuracy under the incidental condition, 

We again tested for the recognition of 
forms in the manner described for the pre- 
ceding variation. Recognition of forms was 


sider 


somewhat, but not strongly, below the levels 
obtained with the intentional procedure (see 
Table 11). In this situation, we find that in 
the majority of cases recognition of form 
went with failure to match correctly. This 
finding is of particular interest with refer- 
ence to the U stimuli (see pp. 36-39). A 
reference to Table 11a shows that the dis- 
parity with the synthetic measures of Ex- 
periment 6 was now quite extreme. This 
result cannot, as we saw, be traced more 
than very partially to lower recognition. We 
have to conclude that matching was to a 
considerable degree independent of recog- 
nition. 

The comparison of these results with 
those of Experiment 8 provides particularly 
striking evidence of the difference between 
the U and non-U conditions. The levels of 
recognition were much the same in the two 
experiments, but they differed markedly in 
matching. Coherence of recall is not a 
function only of perceiving the requisite 
data; the perceptual relations between the 
data are decisive. 

This variation establishes that the differ- 
ence between unitary and nonunitary stimuli 
holds under conditions of incidental learn- 
ing. 

Experiment 10: Recognition of Pairing. 
The procedure called for the recognition of 
the original unit or pair. This mode of test- 
ing, which reduces to a minimum depend- 
ence on recall, should provide the clearest 
evidence concerning the difference between 
the stimulus conditions at the point of trace 
formation. 

The experiment was done under group condi- 
tions, with 5 Ss in each group session. The stimuli 
were projected on a screen in a darkened room, as 


in Experiments 8 and 9. 

Familiarization with the forms and modes pre- 
ceded the step of pairing. Here, we followed in 
all details the procedure described in Experiment 7. 
‘A test of recall of the single forms and modes fol- 
lowed the familiarization trials. 

Thereupon, we proceeded to the joint presenta- 
tion of forms and modes, under conditions iden- 
tical with those prevailing in Experiment 5, except 
for the projection of the stimuli. The Ss were as- 
signed to the two main conditions, and were given 
intentional instructions appropriate to the U or 
non-U series. An illustrative stimulus during the 
instructions demonstrated the task in each condi- 
tion. The test of recognition, which will now be 


20 


described, came 3 min. following the exposure of 
the last stimulus. 

a. Recognition test. For this test each form was 
reproduced on a separate page with 5 different 
modes. For the U condition, the form was drawn 
with each of the 5 modes on the contour; for the 
non-U condition, the continuous form was paired 
with each of the same 5 modes. One of the five 
pairings of form and mode on each page was cor- 
rect. The two sets of 5 modes were selected so as 
to be of approximately equal difficulty, on the basis 
of the data of Experiment 6. (Each form was 
paired with 5, instead of with the entire set of 10, 
modes in order to reduce interference at the time 
of test.) There was a total of 10 pages, one for 
each stimulus. 

The order in which the stimulus appeared in the 
recognition booklet varied from S to S, as follows. 
The recognition stimuli were divided into five sets 
of two each, They were rotated in pairs, so that, 
for some Ss, each pair was the first in the recog- 
nition series. This procedure will permit us to 
determine the level of recognition of the entire set 
of stimuli when they appear early in the series, ог 
before interference due to the method of testing 
has set in. 

There remains to be mentioned one further detail 
of importance. The order of the stimuli was also 
shifted during learning, and in a fixed relation to 
the positions they had in the recognition series. 
The relation of the orders was as follows: 
Recognition Order 123.5678 9 10 
Presentation Order 47 6 9 158 10.32 
We call attention particularly to the fact that the 
first stimulus in the recognition series was always 
fourth in the learning series, and the second was 
always seventh in the learning series. The first 
pair of stimuli was thus by no means in positions 
most immune from interference during learning. 

_ The recognition stimuli described above were 
lithographed and prepared as booklets consisting of 
10 pages. The Ss were instructed to check the 
correct pairing on each page; they were not al- 
lowed to return to an earlier page subsequently or 
to correct an earlier choice. 

The Ss were 50 women undergraduates at Doug- 
lass College, Rutgers University, of whom 25 were 
in the U condition and 25 in the non-U condition. 


b. Results. Recall of the separate modes 
and forms following familiarization was 
identical in the U and non-U groups. The 
total mean recall score was 18.8 in each 
group. This value is closely similar to those 
obtained in Experiment 7; again, the recall 
of modes was slightly superior. 

Recognition of pairing was substantially 
and significantly more accurate in the U 
group. Ss in the U group correctly recog- 
nized 77% of the stimuli ; the non-U group 
recognized 47% of the stimuli (р = .00005). 
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The median of correct U recognitions was 
9. of non-U recognitions 5. The range of 
correct U recognitions for the several stim- 
uli was 14—24, of non-U stimuli 7-15. Each 
U stimulus was superior in recognition to 
the corresponding non-U stimulus. Figure 
4, which summarizes the last result, plots 
the recognition scores of the stimuli in both 
conditions. : 

Further analysis showed that the position 
of the stimuli in the recognition test affected 
accuracy. Comparing the first and last two 
positions, we obtain mean scores in the U 
series of 88 and 72%, respectively ; the cor- 
responding non-U scores were 48 and 40%, 
respectively. 

It is of particular interest to establish the 
level of recognition of the first stimulus 1n 
each recognition test. Since the stimuli 
were rotated in pairs, there were only five 
different stimuli in the first position. In the 
U series, these stimuli were recognized in 
96% of the cases (by 24 out of 25 Ss) ; the 
corresponding value for the non-U stimul! 
was 52%. It seems noteworthy that recog- 
nition of the first stimuli in the recognition 
series was nearly perfect in the U series, 
particularly when we consider that they 0C 


— u GROUP (N*25! 


——— non-u GROUP (25 


NUMBER OF CCRRECT RECOGNITIONS OF PAIRS 


STIMULI 


г нату 
Fic. 4. Recognition of unitary and nonunit р 


pairs. This figure plots the number of corte с 
recognitions of each stimulus in the unitary ac] 
nonunitary series. The stimuli were ordere 1 165, 
cordance with decreasing scores in the U i | 
The stimuli are numbered as in Figure 3. 
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cupied the fourth position in the learning 
series, one that does not confer immunity 
from interference. We will consider the 
possible significance of this finding subse- 
quently (see p. 38). 

Figure 4 shows a definite correspondence 
between ease of recognizability of the indi- 
vidual stimuli in the two conditions. The 
rank order correlation between the U and 
non-U series was .70. Despite the systematic 
differences between the two conditions, the 
difficulty of pairing a given stimulus re- 
tained a substantial constancy. 

These findings establish that the advan- 
tage of stimulus unity persisted when the 
contribution of recall proper was reduced to 
a minimum. We take this as evidence that 
stimulus unity favored the formation of 
functional connections. 


Tue Ficure-GrounD RELATION 


We have studied the role of two rela- 
tions: of constitution and of the pairing of 
spatially separated heterogeneous units. The 
former we took to be an instance of unity; 
the latter, of duality. As mentioned earlier, 
there are most likely other sources of unity 
and duality. What follows is an attempt to 
explore the relevance to the present investi- 
gation of another relation—that of figure 
and ground. | 

The role of the figure-ground relation in 
the formation of associations has not been 
studied or raised to the level of a problem. 
The reason is a paradoxical one. There has 
been no realization that, considered as a per- 
ceptual situation, the classical association 
experiment has а highly particular struc- 
ture. The Stimuli of the association experi- 
ment are given as pairs; pairedness is estab- 
lished by means of contiguity. Analysis has 
failed to note the following regular property 
of the association experiment: the paired 
stimuli have been universally of coordinate 
rank, each having the status of figure on the 
same ground. 

The pertinence of the figure-ground rela- 
tion to the present inquiry is quite clear. 
Figure and ground constitute a unit, but 
they are segregated subparts, differing in 
phenomenal and functional characteristics. 


A study of the coherence between figure 
and ground should also throw light on a 
question arising from the preceding experi- 
ments. The unitary and nonunitary stimuli 
of the preceding section differed with re- 
spect to spatial proximity: form and mode 
coincided in locus in the one case, but were 
separated in the other case. Therefore, the 
question arises whether the results obtained 
were a function of spatial proximity alone. 
An investigation of the role of the figure 
and ground, which involves a change of 
proximity relations, should clarify this ques- 
tion, 


Experiment 11: 
and Ground 

The general technique of this investiga- 
tion can be readily adapted to the study of 
coherence between figure and ground. The 
forms of Experiment 5 were drawn in 
thick continuous contour. The modal prop- 
erties filled the entire background, includ- 
ing the inner area of the form. Figure 5 
provides an illustration. 

The instructions, as in the previous experiments, 
were to study each card so as to be able to repro- 
duce the form and its background, The conditions 
of learning were identical with those of Experi- 
ment 5. A test of free recall was employed. There 
were 24 Ss, 12 men and 12 women, The results 
appear in Table 12. 

]. The mean level of recall, of both men 
and women, was somewhat but not signifi- 
cantly lower than in the U and non-U con- 
ditions of Experiment 5 (see Table 4). The 
decrease was greater for men than for 


women. 


The Relation of Figure 
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Fic. 5. A sample stimulus for the figure-ground 
condition. 
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TABLE 12 


RECALL SCORES OF FIGURE—GROUND STIMULI 
Experiment 11: Free Recall Test 


(№ = 24) 
1 2 3 4 5 
Cor- Incor- | Recall | Recall| Mean 
rectly | rectly of of of 
Paired | Paired | Form | Mode | Total 
Recall | Recall | Alone | Alone | Recall 
+(ЕМ) | - (FM) 
A 3.3 2.3 1.8 145 8.8 
| 
% 37.1 25.1 20.5 16.7 


2. Correct pairings were significantly 
lower than in the U condition of Experi- 
ment 5 (p = .006), and insignificantly dif- 
ferent from the non-U condition of Experi- 
ment 5 (see Table 4). The values of the 
index of pairing accuracy (see Table 6) 
place this condition unambiguously with all 
others that were nonunitary. 

We conclude that the figure-ground or- 
ganization acted as a segregating condition 
for purposes of association. 

The results also have a bearing on the 
role of proximity, the investigation of which 
was another object of this variation. The 
proximity of form and modes approximated 
to that prevailing in the unitary condition, 
while differing markedly from the non- 
unitary condition. Nevertheless, the results 
were near to the latter, while departing 
strongly from the former condition. While 
this finding does not prejudge the general 
issue of the role of spatial proximity in as- 
sociation, it does suggest that the variations 
of proximity employed in the present in- 
vestigation were without effect. The conclu- 
sion follows that the differences obtained 
between the unitary and nonunitary series 
are traceable to other sources, and that the 
relations of proximity were effective only 
in so far as they mediated other relations. 

Further procedures suggest themselves 
that might clarify this first result concerning 


the role of figure-ground articulation. One 
obvious modification of the present proce- 
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dure would be to exclude the modal stimuli 
from the inner area of the contour. It 
would be of particular interest to go one 
step further, and eliminate the continuous 
contour altogether, as illustrated in Figure 
6. Part of this stimulus distribution is 
identical with the constitutive unitary stim- 
ulus, but the results should continue to 
diverge, in consistency with what we have 
found. 

It would also be instructive, in further in- 
vestigation, to fill the area enclosed by the 
contour with one set of modes, and the 
outer ground with a different set of modes. 


Tue RELATION OF INCLUSION 


We now turn to another relation—to the 
inclusion of one visual form within the con- 
tour of the other. It is worth considering 
in its own right ; in addition, it should throw 
further light on the role of spatial proximity 
under the present conditions. Finally, we 
will see that this variation stands in an in- 
teresting relation to Experiment 11. 
Experiment 12: The Included Condition 


The forms of Experiment 5 were drawn 
in continuous outline, and only the inner 
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Fic. 6. A further figure-ground condition. 


PERCEPTUAL CONDITIONS OF ASSOCIATION 


Fic, 7. Stimulus series for the included condi- 


tion, 
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Results. Total recall was somewhat, but 
not significantly, less than in Experiment 5, 
as Table 13 shows. Accurately paired re- 
calls were 51.796 of all recalls, a value ex- 
actly between the U and non-U levels of 
Experiment 5 (see Table 4). It might there- 
fore appear that the procedure of inclusion 
created an intermediate condition. In what 
follows it will be seen that this would be an 
incorrect conclusion. 

The results contained a complicating cir- 
cumstance. There were considerable differ- 

ences among the stimuli in accuracy of joint 
recall. An examination of the stimuli raised 
the suspicion that they were not phenomen- 
ally of the same kind, although they had 
been prepared in accordance with a con- 
sistent and prescribed procedure. Some of 
the included modes appeared to "cohere" 
with their form far more than others. If 
this surmise should be correct, the stimuli 
did not constitute a homogeneous set. 

То check on this possibility, we proceeded 
as follows. Six judges, two instructors in 
psychology and four graduate students, 
were shown individually the set of 10 stim- 
uli spread out in a random (and constant) 
order. The judges, who had no knowledge 
of the experiment, were asked to select one 
stimulus the inner area of which "cohered" 
with the form, and another that lacked such 
coherence. The instructions did not attempt 
to define the basis of the division beyond the 
statement that form and mode "went better" 
together in some instances than in others. 


TABLE 13 


RECALL SCORES OF INCLUDED STIMULI 
Experiment 12: Free Recall Test 


(N = 24) 
1 2 3 4 5 
Сог- Incor- | Recall| Recall| Mean 
rectly | rectly of of of 
Paired | Paired | Form | Mode | Total 
Recall | Recall | Alone | Alone | Recall 
+(FM) | —(FM) 
M 4.4 1.6 1.6 1.0 8.5 
% 51.7 18.5 18.5 | 11,2 
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Thereupon, the judges were requested to 
divide the entire set into two equal halves 
on this basis. Finally, the judges ordered 
the stimuli in each half according to degree 
of coherence and lack of coherence, respec- 
tively. We thus obtained from each judge 
a ranking of the entire set of stimuli on a 
dimension of "coherence." 


We scored the stimulus selected by each 
judge as most coherent 10, the next 9, and 
so on. The scores assigned to each stimulus 
by all six judges were then summed. Thus, 
we obtained an ordering of the stimuli in 
accordance with a phenomenal criterion of 
coherence. 

There was a very substantial agreement 
among the judges. The mean rankings of 
the stimuli, in the order in which they are 
reproduced in Figure 7, were: 2.0, 7.7, 3.7, 
3.0, 62, 87, 7.5, 7.7, 27, and 6.0. 

We then divided the stimuli into two 
halves of five each, in accordance with their 
mean rankings, and calculated the respective 
recall scores. Let us call the upper half “со- 
herent” (or C), and the lower half “not co- 
herent” (or —С). (The С stimuli were: 2, 
5, 6, 7, and 8; see Figure 5.) For purposes 
of comparison we grouped similarly the 
scores of the corresponding stimuli in Ех- 


periment 5, both in the U and non-U condi- 
tions. The results appear in Table 14. 

There was an impressive correspondence 
between the recall scores and the—inde- 
pendently obtained—judgments. First, the 
total recall of stimuli judged coherent was 
significantly superior to the recall of the less 
coherent stimuli (see Table 14, row 5). 
Further, this superiority was almost com- 
pletely due to greater accuracy of joint 
recall. Correctly paired recalls comprised 
67.2% of all recalls among the C stimuli, 
and 27.5% among the others. By the Wil- 
coxon test, the difference in absolute num- 
ber of pairings is significant at far beyond 
01. The difference is also significant when 
the proportion of correct pairings to total 
recall in every category is the score; by the 
sign test this difference is significant at be- 
yond .002. 

For comparison we have grouped the re- 
sults of Experiment 5 similarly. Since the 
basis of division is irrelevant to this varia- 
tion, it is not surprising that there were no 
appreciable differences between C and —C 
stimuli. Indeed, accuracy of pairing of —C 
stimuli tended to be somewhat higher in 
Experiment 5. 


The principal finding is the close resem- 


TABLE 14 


CoMPARISON OF COHERENT AND NONCOHERENT STIMULI 


Experiment 12 Experiment 5 
Included Stimuli Unitary Stimuli | Nonunitary Stimuli 
timull 
(N = 24) (N = 24) (V ul 

2 = s =c с =e 
1. Mean Correct Pairings: +(FM) 3.5 0.9 2.9 за Р 1.8 
2. Mean Incorrect Pairings: — (FM) 0.7 0.9 08 5 ү, НЕ 
3. Mean Recall of Е Alone 0.5 1.0 10 02 i ae 
4. Mean Recall of М Alone 0.5 0.5 01 "s an 0.7 
5. Mean Sub-total Recall 5.2 3.3 48 re il pi 

6. Mean Total Recall 8.5 бй аы 
7. Percentage of Correct Pairings 67.2 21.5 6.9 ^ 70.4 TS 39.3 
8, Percentage of Incorrect Pairings 13.6 26.2 17.4 10.3 33.1 34.6 
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blance, with respect to accuracy of pairing, 
between the C stimuli of the present experi- 
ment and the corresponding U stimuli of 
Experiment 5, and the resemblance between 
the —C stimuli of the present experiment 
and the corresponding non-U stimuli of Ex- 
périment 5.!* The values of the index of 
pairing accuracy (see Table 5) offer further 
convincing support for this statement. 

Includedness per se is thus not a condi- 
tion favoring joint recall. This conclusion 
receives support from an earlier unpublished 
experiment by Celeste McCollough and the 
writer. The association between pairs of 
heterogeneous nonsense figures was com- 
pared when they were spatially separated 
and when one appeared inside the contour 
of the other. No differences of association 
were found between these conditions. 

Again, the results bear on the role of 
proximity. With proximity (of form and 
mode) equalized, there were nevertheless 
marked and consistent differences between 
the two subgroups that were discriminated 
on a phenomenal basis. Proximity clearly 
cannot account for this result. Further, we 
have also found that, with respect to pair- 
ing, the C stimuli of this series were similar 
to the U stimuli of Experiment 5, and that 
the —C stimuli resembled the non-U stimuli. 
This result suggests that the differences in 
pairing between the U and non-U stimuli 
Were also not a function of proximity. (For 
а further discussion of this point see pp. 
31, 34.) 

We also note that, as a consequence o 
the relation of inclusion, all stimuli of this 
experiment were phenomenally of the ' p 
Variety. If so, membership in a $18 
Perceptual constellation is not à soe 
that favors pairing. We need to Lue me 
between the property of “singleness an 
the quality of unity. The difference eee 
the unitary and nonunitary stimuli of in 
Carlier experiments is therefore not ade- 
quately described in terms of the difference 
—— 

14 As mentioned above, 
differed also in general lev 
No explanation for this гези 
dition may have accentuated t 
the two subsets. We found some ev 
Same trend in Experiment 4. 
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between “опе” and "two" either. Indeed, it 
can be readily shown that neither proximity 
nor numerical unity are decisive for figural 
unity. 

What, then, is the perceptual difference 
between the stimuli we have called “со- 
herent" and "not coherent"? To us it 
seemed that the form and contour of the 
stimuli judged coherent participated in the 
formation of a unitary surface. In these 
cases the filled area was seen as belonging 
to the form. In the other cases the inner 
area was seen simply as filled with forms of 
a particular kind; the contour and the en- 
closed shapes had distinct identities. If this 
interpretation is in the right direction, the C 
stimuli were phenomenally and functionally 
unitary, in contrast to the —C stimuli, and 
the present results provide independent evi- 
dence concerning the effect of perceptual 
unity on pairing. 

It is a question for separate investigation 
to establish the stimulus conditions of sur- 
face formation in this case. The present 
study provides only tentative hints. It 
seemed to us that the less individual the 
modes were as forms in their own right, the 
better they lent themselves to this effect; 
thus, dots and stippled areas were judged 
predominantly coherent. It is also probable 
that the size of the included forms and the 
uniformity of their distribution, as well as 
their figural relation to the including form, 
play a part. 

It also follows that the specific sources of 
phenomenal unity are diverse. According to 
the interpretation here offered, in the case 
of the C stimuli, includedness mediated a 
particular organization, namely, the forma- 
tion of a surface, just as the relation of con- 
stitution mediated the phenomenal unity of 
the stimuli of the preceding experiments. 
It would appear that the complete spatial 
coincidence of two properties is one condi- 
tion of unit formation. Correspondingly, 
nonunity, under certain conditions, may be 
achieved by spatial separation, and also by 
failure of surface formation. 

The present experiment stands in an in- 
teresting relation to Experiment 11. Viewed 
in a certain way, they are identical in this 
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respect: both have continuous forms with 
modes filling the inner area. The difference 
between them is equally clear: in the present 
experiment the modes were confined en- 
tirely within the contour, whereas in Ex- 
periment 1l they extended into the back- 
ground. The following comparison between 
them seemed, therefore, worth attention. 

We have just identified two subsets of 
stimuli—coherent and not coherent—that 
differed markedly. Let us now observe the 
performances on the corresponding subsets 
of the figure-ground condition. The results 
appear in Table 15. 

The difference between coherent and not 
coherent stimuli disappears completely when 
the modal components extend into the 
ground. The finding that the interaction be- 
tween a contour and included modes is 
abolished when the latter are also part of 
the ground makes good sense from the 
standpoint of perception. When the modes 
are part of the ground, those falling within 
the contour lose their specific relation to it, 
and become a mere continuation of the 
ground. Thus, we have established a differ- 
ence between the relation of inclusion and 
the figure-ground relation. 

This investigation has provided further 
confirmation of the role of stimulus unity. 
The difference between the more and less 


unitary stimuli corresponded to those of the 
earlier experiments. The specific grounds of 
unity were different in the present setting, 
but the consequences for joint recall were 
quite similar. 

Experiment 13: 
cluded Condition 

As a check on the preceding results, 
Experiment 12 was repeated with a new 
pairing of forms and modes. The forms 
and modes were those of the preceding 
variation, but the pairings were now those 
of Experiment 1. 

The procedure described above was fol- 
lowed without change. Again, the stimuli 
were judged by a (new) group of 6 judges 
for “coherence.” The Ss were 24 Swarth- 
more College students, 12 men and 12 
women. 

The results were in all essentials similar 
to those of Experiment 12. The new sets of 
C and —C stimuli differed markedly in total 
recall and in accuracy of joint recall. By 
the Wilcoxon test the difference in absolute 
number of pairings was significant beyond 
.01. Correct pairings accounted for 70.8% 
of all C recalls, for 46.5% of all —C recalls. 
The ratio of correct pairings to the total re- 
call score of each 5 was also computed for 
each $ in each category ; by the sign test, Р 
= 03. 


4 Variation of the In- 


TABLE 15 


COMPARISON OF COHERENT AND NONCOHERENT STIMULI 


Experiment 11 Experiment 12 
Figure-Ground Stimuli Included Stimuli 
(N = 24) (N = 24) 

c -C c Eo 

1. Mean Correct Pairings: +(FM) 1.4 | E 1.8 3.5 0.9 

2, Mean Incorrect Pairings: — (FM) 1.0 1.3 0.7 0.9 

3. Mean Recall of F alone 1.0 0.8 0.5 1.0 

4. Mean Recall of M alone 0.8 0.7 0.5 0.5 

5. Mean Sub-total Recall 4.2 4.6 5.2 3.3 

6. Mean Total Recall 8.8 | | 8.5 

7, Percentage of Correct Pairings 34.0 40.0 67.2 US ans 
(1 +5) : | 

8. Percentage of Incorrect Pairings 24.0 25.5 13.6 26.2 
Q + 5) : | 
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‘These results actually underestimate the 
difference between C and —C stimuli. While 
the judges were consistent in differentiating 
between the extreme stimuli of this series, 
they were far more variable when it came 
to the 4 middle stimuli. The mean rankings 
of the latter were very close to each other, 
attenuating the effect. When we compared 
the three stimuli at each extreme, correct c 
pairings accounted for 70.676 of all recalls, 
and correct —C pairings accounted for 
37.396 of all recalls. The results substan- 
tially confirm the findings of Experiment 12. 

We have also asked whether the same 
modes and forms were judged coherent in 
Experiments 12 and 13. As might be ex- 
pected, the modes proved more decisive. 
Three modes (dots, dashes, and stippled 
areas) produced high judgments of coher- 
ence when they filled the areas of different 
forms. As mentioned earlier, the lack of in- 
dividuality of the modes, and the homo- 
geneity of their distribution, was probably 
responsible for the impression of areal 
homogeneity, and consequently for surface 
formation. 

Experiment 14: A Further Variation 

The principal result of Experiments 12 
and 13, namely, the difference that emerged 
between stimuli judged "coherent and 
“not coherent,” is susceptible of another 
interpretation. We have seen that the 
former generally contained modes of rela- 
tively homogeneous distribution. This sug- 
gests the possibility that modes belong to 
"coherent" stimuli duplicated the shape of 
the contour. If so, the advantage they had 
may not have required the participation of 
the boundary line. The modes in question 
would then represent a modified version of 
the earlier U stimuli, while the —C stimuli 
would correspond to the earlier non-U 
condition. Consequently, the interpretation 
advanced above would be incorrect. The 


present experiment was designed as a check 


on this possibility. im 
: А xperi t 12 were modified as 
The stimuli of Experimen 
йоу. "Each card consisted = a M of жиг 
imuli, The stimuli left were the continuous 
stimuli. The stimuli at the x i 
е i at the right were 
forms of Experiment 12, those а 
the arrays of modes which belonged that 
The latter were grouped precisely in the way they 


were previously. Thus, the stimulus contents were 
identical with those of Experiment 12, except that 
the modes were now separated from the contour. 

The Ss were given free recall instruction. In 
addition, the instructions specifically called atten- 
tion to the similarity of shape between the terms, 
as the following statement shows: “Each card will 
contain two figures. One will be that of a simple 
outline form at the left; and the other will be that 
of a number of smaller forms grouped together at 
the right. It will help you in remembering the 
pairs later to notice that the smaller drawings at 
the right have an overall shape quite similar to the 
one at the left. I want you to look at the drawings 
carefully so that later you will be able to draw 
from memory as many of the pairs as possible." 
A free recall test followed. 

If similarity of form between contour and 
modes was responsible for the difference be- 
tween the “coherent” and “not coherent" 
stimuli of Experiment 12, this difference 
should also appear under the present condi- 
tions. If the difference between the two sets 
of stimuli should disappear, the conclusion 
would be in order that "coherence" in Ex- 
periment 12 was mediated by the surround- 
ing contour. 

The Ss were 7 men and 5 women students 
at Swarthmore College. The results, which 
appear in Table 16, contain the data for the 
stimuli which had been categorized in Ex- ` 
periment 12 as "coherent" and "not co- 
herent.” 

Mean recall was closely similar to that 
obtained in Experiment 12. The proportion 
of correct pairings to recall, for the entire 
series, was somewhat lower; there was also 
a higher level of isolated recalls. 

The C stimuli did show a trend toward 
more accurate pairing than the —C stimuli, 
but the difference was small and not signifi- 
cant; the effect was far weaker than in Ex- 
periment 12. There was no difference be- 
tween C and —C stimuli in inaccurate pair- 
ing, in contrast to the results of Experiment 
12. 

The results do not convincingly support 
the view that the advantage of the “coher- 
ent” modes is to be traced to their form 
quality, and that this accounts for the find- 
ings of Experiments 12 and 13. It should 
be added that the present variation may 
have been weighted too one-sidedly against 
the surface formation alternative. The pro- 
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cedure of separating the continuous contour 
from the mode aggregates, it seemed to us, 
enhanced the perceived form similarity be- 
tween them beyond what was the case in 
Experiments 12 and 13, in addition to which 
the instructions explicitly stressed this rela- 
tion of similarity. Nevertheless, given our 
lack of knowledge concerning the inter- 
action between a contour and its included 
modes, one must reserve a final conclusion. 

We can, however, draw the lesser conclu- 
sion that the factor of proximity was not 
responsible for the differences obtained be- 
tween the unitary and nonunitary stimuli of 
the earlier experiments. 


INTERPRETATION AND DISCUSSION 


We have studied an aspect of coherence 
that has not hitherto claimed attention. 
"Theory and investigation in this region have 
placed in the forefront the role of condi- 
tions such as temporal contiguity and fre- 
quency. This investigation was concerned 
with a different question, namely, with the 
role of different kinds of stimulus relations 
in establishing coherence. 


Accordingly, we compared the formation 
of functional connections between stimuli 
whose relations were varied in a number of 
ways. The stimuli were heterogeneous 
visual forms, and the relations between 
them were as follows: they were spatially 
segregated units; they belonged to the same 
visual contour, being in a constituent-whole 
relation; the boundaries of one surrounded 


the other; one had the function of figure, 
the other of ground. These conditions dif- 
fered in important ways. More particularly, 
they could be grouped on the basis of the 
memory data in two categories. One set of 
conditions produced markedly superior con- 
nections, the other was more resistant to 
their formation. Table 17 summarizes the 
findings for the entire range of conditions. 
The two major groupings include rela- 
tions of considerable diversity. We pro- 
posed that the basis of division was the 
property of unity, that unitary stimuli were 
superior in establishing functional coher- 
ence. ES 
What grounds are there for identifying a 
given stimulus distribution as unitary? To 
avoid circularity the decision, of course, 
must be independent of the memory data. 
We relied in the first instance on phenom- 
enal evidence. The division of the stimuli 
on a phenomenal basis was straightforward. 
In some of the instances studied, special in- 
quiry was unnecessary. Most, if not all, 
observers would agree to call a set of repeti- 
tive modes delineating a shape unitary, an 
the spatial separation of these from а con- 
tinuously outlined shape dual (as in Ex- 
periments 1-10) ; at least they would agree 
that the former are more unitary than the 
latter. There would be similar agreement 
that the figure-ground distributions in EX- 
periment 11 are less unitary than the forms 
delineated by modes. In other instances that 
were less clear, phenomenal judgments were 
systematically obtained for the purpose ; this 


TABLE 16 È 


COMPARISON OF COHERENT AND NONCOHERENT STIMULI 
Experiment 14: Free Recall Test 


1. Mean Correct Pairings: --(FM) 

2. Mean Incorrect Pairings: — (FM) 

3. Mean Recall of F alone 

. Mean Recall of M alone 

‚ Mean Sub-total Recall 

. Mean Total Recall 

7. Percentage of Correct Pairings (1 - 5) 
8. Percentage of Incorrect Pairings (2 + 5) 


ane 


с =E 
2.2 1.3 
25, 3 
1.0 1.4 
e :9 
4.4 4.0 
8.4 
49.1 33.3 
11.3 8.3 
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TABLE 17 
Summary TABLE OF RECALL SCORES OF UNITARY AND NONUNITARY STIMULI 
(in %) 
Unitary Stimuli Nonunitary Stimuli 
Correctly Incorrectly А Correctly Incorrectly 
Experiment Paired Paired Experiment Paired Paired 
Recall Recall Recall Recall 
1 76.2 7.4 1 49.2 31.4 
2 77.8 22.2 2 40.7 59.3 
3 74.6 25.4 3 48.5 51.5 
4 88.7 0.9 4 29.3 37.8 
5 65.5 13.8 5 38.1 33.8 
12:С 67.2 13.6 12:—C 27.5 26.2 
11 37.1 25.7 


Note.-The experiments included in this table employed the method of free recall, except for Experiments 2 and 3, which 


employed paired-associate recall. 


was the case with the inclusion of modes in 
the area bounded by a contour (see Experi- 
ments 12 and 13). № 

Since the objective stimulus properties in 
most cases can be specified exactly, it is pos- 
sible to relate them to phenomenal unity. 
Accordingly, we can say (a) that the rela- 
tion of constitution is a condition of phe- 
nomenal unity, (b) that the spatial separa- 
tion of nonsimilar forms is a condition of 
duality, as is also (c) the presence of heter- 
geneous forms in the relation of figure and 
ground. The stimulus conditions adequate 
to surface formation, as we have seen, await 
further investigation. 

Implicit in this discussion is the assump- 
tion that unity (and duality) has its source 
in different specific conditions, some of 
which we have doubtless not mentioned. 
The question arises whether the general 
category of unity, when applied to instances 
as different as those here considered, and to 
others that were not considered, has a con- 
crete content. It is not possible to specify 
at present an objective ws ant 

11 instances experien - 
A. dei, respectively. Would it there- 
fore not be simpler to describe the results 
in terms of the objective characteristics of 


the stimulus and to forego reference to a 
property that can only be identified phe- 


nomenally? Such a procedure would leave 
entirely unexplained the fact that each set 
of diverse stimulus relations produced con- 
sistently similar results. Although the 
grounds of phenomenal unity require fur- 
ther investigation, they should not for that 
reason be ignored. It may well be that the 
phenomenal evidence is in advance of what 
we are at present able to formulate in other 
terms. 

A further question will have to be faced 
in connection with the property of unity. 
There may be grounds for holding that any 
two stimuli, however heterogeneous, consti- 
tute a unit, when they are the only items in 
the field. Another way of stating the same 
point would be to consider that any pair of 
units may form a single pattern. It would 
follow that the stimulus conditions we are 
contrasting are all unitary, and that they 
differ mainly in degree of unity. If so, the 
question we have put would concern differ- 
ences between more or less unitary condi- 
tions, or between units differing with respect 
to the autonomy of their subparts. It is of 
considerable importance to establish whether 
the difference in question is only one of de- 
gree, and we will consider this point subse- 
quently (see pp. 39-43). However this is- 
sue is decided, the main outcome of this 
investigation remains unchanged. It has 
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demonstrated that the coherence of data in 
memory is a function of their unit relation. 

The findings are restricted to a particular 
modality—the visual—and to a limited set 
of properties within the modality, and there- 
fore need to be verified in other settings. 
But they provide the basis for a general 
conclusion: whenever we can specify the 
relative unity of two stimulus distributions, 
we can predict their relative coherence. 

In what follows we will explore further 
the conclusion just stated, including its bear- 
ing on the nature of associations. 


Alternative Interpretations 


The procedure we have followed was to 
relate differences of recall to given stimulus 
conditions. It is necessary, therefore, to 
establish whether stimulus properties other 
than those we have identified might have 
been responsible for the findings. This ques- 
tion has concerned us earlier; we will now 
consider it more systematically. 

1. One obvious requirement of this study 
is that the respective stimulus conditions be 
equally perceptible or discriminable. In the 
light of the results already reported, this 
requirement can be modified as follows: 
differences in perceptibility must not favor 
the conditions that are superior in coher- 
ence. Experiment 6 demonstrates, we be- 
lieve, that this requirement was met. There, 
we employed recognition (of forms and 
modes, respectively) as an index of percep- 
tibility. (a) The main finding was that the 
nonunitary stimuli (in particular the modes) 
were somewhat more recognizable. This cir- 
cumstance should favor the joint recall of 
nonunitary stimuli. Nevertheless, the uni- 
tary condition was superior with respect to 
joint recall. (b) It might be maintained 
that the issue concerns the perceptibility of 
the entire stimulus, or the perceptibility of 
the given relation. For example, was the 
constituent-whole rclation more easily dis- 
criminable than the relation of adjacency 
between heterogeneous forms? Such thres- 
hold differences could determine the mem- 
ory results. In this connection we would 


15 The writer is grateful to E. T. Gibson for call- 
ing attention to this point, 


cite the "synthetic pairings" of Experiment 
6, or the instances of joint recognition of 
the form and mode of a given unit or pair 
(see р. 14). Such joint recognition was 
more frequent in the nonunitary series. We 
take this result to mean that the threshold 
of perceptibility was lower in the non- 
unitary series. (c) Experiment 7 provides 
evidence in the same direction. The proce- 
dure of familiarization there employed 
raised the recognizability of the forms and 
modes to a maximum and also equalized it 
for the two conditions. The persistence 
(and increase in magnitude) of the experi- 
mental effect under these circumstances is 
convincing evidence that differences of dis- 
criminability were not responsible for the 
results. 

2. The rationale of this study requires 
that the conditions be equivalent in content, 
or in "amount" of information. No finding 
is more clearly established than that it 1$ 
more difficult to master more material than 
less. When two otherwise similar tasks 
differ in case of mastery, the question im- 
mediately arises whether they differ in 
"amount to be learned." 

а. The assumption of such a difference 
may take the form of asserting that the 
unitary series consisted of single stimuli, 
and the nonunitary series, of pairs; or that 
the latter contained twice as many stimuli 
as the former. This formulation begs the 
question. It ignores the fact that each con- 
dition contained forms and modes, that these 
were heterogeneous, and that one could not 
be deduced from the other in either condi- 
tion. The assertion is actually an outcome 
of the correct intuitive inference that the 
unitary condition is the easier, and of the 
unexamined assumption that the responsible 
difference is one of amount of material. 

b. The following is a more serious Уе" 
sion of the same point. The nonunitary 
stimuli consisted of pairs, each with its 
form and mode. The a term was a particu” 
lar form delineated by a continuous mode’ 
the b term also had a particular shape, e. 
of a straight line, and was composed = 
modes. Thus, each pair contained two for? 8 
and two modes. The unitary stimuli, how 


| 
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ever, eliminated the continuous mode and 
the linear shape. 

Two points might be made in answer. 
First, the nonunitary Ss knew from the 
start that the stimuli would consist of con- 
tinuous forms paired with lines; they did 
not need to learn about this feature of the 
task. It must be admitted, though, that this 
is not a conclusive reply; the fact remains 
that the nonunitary series contained data 
absent from the unitary series. Second, and 
more weighty, is the consistent finding that 
the conditions were very similar in content 
recall. This result goes counter to the inter- 
pretation that a quantitative difference of 
material was responsible for the observed 
effect. 

3. A more plausible objection is that the 
unitary stimuli were less homogeneous, and 
therefore less vulnerable to mutual interfer- 
ence. Each condition had a series character. 
In particular, the nonunitary set contained 
two well-defined series—continuous forms 
and straight lines. The members of each of 
these series, by virtue of similarity, may 
have been more exposed to interference 
than those of the unitary series. Although 
this homogeneity did not impair recall of 
content, as we saw earlier, it is quite pos- 
Sible that it contributed to associative 
Interference. 

"It was in this connection that we per- 
formed a control experiment employing the 
Procedure of familiarization (Experiment 
This experiment provided eal = 
dence that differences of intraseria 29 
terence do not account tor the 2680 tS 
Pamiliarization raised the recall of — 
and modes to a near maximum prior to че 
main task, equalizing the availability E^ E 
items in the respective conditions. => 
could only have the effect of fores si ШЕ 
importance of any differential merirem 
at the time of learning. If interference en 
been responsible for the results, differences 
between the two series should have been — 
duced. The opposite was the Ree, = 
Should be added that if we had obtaine ev 
dence for the differential role of piecer 
ence, this would not have necessarily A 
Solved the issue. Tt would still remain to be 


decided whether the interference in ques- 
tion was a cause of the experimental effect, 
or a consequence of the unit relation.) 

Here also should be mentioned the ex- 
periments involving the relation of inclu- 
sion. The presence in the series of dense 
fields of modes, in accordance with the 
argument, should have made for very con- 
siderable interference; the clustered modes 
were as homogeneous as their counterparts 
in the nonunitary series of the earlier ex- 
periments. The results support the interpre- 
tation that it was the relation of form to 
modes which exerted the decisive effect. 

Finally, it seems appropriate to caution 
against too ready an acceptance of the rele- 
vance of an interference interpretation for 
data such as these. It seems to us more 
tenable that the poorer performance in the 
nonunitary condition was a direct conse- 
quence of the greater resistance these stim- 
uli offered to the formation of associations, 
and possibly of their lower resistance to 
disintegration. 

4. Unitary and nonunitary stimuli dif- 
fered with respect to the spatial proximity 
of forms and modes. If we had to rely on 
the data of Experiments 1 to 5 alone, it 
would be a tenable conclusion that spatial 
proximity was responsible for the results 
obtained. Accordingly, we altered the prox- 
imity of forms and modes in Experiments 
12 and 13. Although these variations equal- 
ized the proximity relation for all members 
of the series, they produced two sets of 
stimuli differing appreciably in pairing. One 
of the sets gave pairings quite comparable 
to those obtained with the nonunitary stim- 
uli, despite the evident differences between 
them in proximity. Finally, the results of 
the figure-ground variation of Experiment 
11 resembled closely those found with the 
nonunitary stimuli, although the two again 
differed in proximity. 

We thus found that conditions identical 
in spatial proximity differed in joint recall 
and that others differing in proximity were 
equivalent in this respect. Proximity cannot 
account for these results. Whatever the 
effect of proximity per se might be, other 
circumstances must be in large part respon- 
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sible for the present findings. The interpre- 
tation we propose is that spatial proximity 
was of consequence in this investigation 
only insofar as it affected the property of 
unity, and that differences of proximity 
were otherwise without appreciable effect. 


5. A related question is whether the con- 
trasted conditions differed with respect to 
the temporal character of the stimulation. 
In the unitary case form and modes were 
most likely perceived simultaneously or 
nearly simultaneously, but they may have 
been apprehended in a more successive way 
when they were spatially separated. Indeed, 
one finding reported earlier may seem to 
point in this direction (see Experiment 3a), 
and to suggest that a division of attention 
played a part when the stimuli did not oc- 
cupy the same position. 

Differences of temporal stimulation, we 
believe, cannot account for the results. 
First, in the included and figure-ground 
conditions, form and modes occupied the 
same regions, a circumstance that should 
have reduced successiveness, but the results 
do not go in the presumed direction. Sec- 
ond, there is no convincing reason for the 
belief that present differences of temporal 
Successiveness could be responsible for the 
order of effects obtained in experimentation 
of this kind. Finally, successiveness of ap- 
prehension, to the degree it played a part, 
may itself have been a consequence of the 
perceived lack of unity. 

6. It is reasonable to ask whether unitary 
stimuli lent themselves more easily to mne- 
monic devices. A unitary stimulus might 
be more likely to resemble familiar objects. 
Also, it might be verbally describable in a 
more concise way than a pair of stimuli; 
thus, the designation "dotted circle" is per- 
haps easier to recall than "circle and dots." 
1f so, the experimental effect would be only 
indirectly an effect of the perceptual prop- 
erties in question, and a direct consequence 
of the intervening verbal operations.!* 


16 One may ask, though, whether the assumption 
that the one verbal phrase is more effective than 
the other does not presuppose the very principle 
that is in question. 


Our observations showed that labelling 
did occur with some frequency, though by 
no means uniformly. We have no evidence 
that the conditions differed systematically in 
the manner suggested above; it is unlikely 
that this was the case. (a) Perhaps the 
most convincing evidence against this inter- 
pretation comes from Experiment 9, which 
employed a procedure of incidental learn- 
ing. The latter should have decreased the 
use of labelling in both conditions, and con- 
sequently the magnitude of the experi- 
mental effect. This was not what we 
found. (b) This interpretation cannot 
readily account for the differences between 
the "coherent" and "not coherent" sub- 
groups of Experiments 11 and 12. 


7. Finally, it may appear that the findings 
can be interpreted in terms of Thorndike’s 
factor of “belonging.” It is clear to us that 
this is not the case. The nonunitary condi- 
tions of this study conformed completely to 
the requirement of belonging, in the sense 
of Thorndike, fully as much as the unitary 
conditions. Thorndike (1932) restricted be- 
longing to the realization that "this goes 
with that," expressly asserting that it re- 
ferred to "nothing logical, or essential, or 
unifying... ." (p. 72). It would therefore 
be confusing to identify belonging with the 
relation of unity. 

Let us summarize the central issue of this 
section. We have found that a unitary stim- 
ulus joins its parts more intimately, or that 
one can go, in recall, from one aspect of it 
to another more easily than one can from 
one unit to another. What, we may ask, is 
the selective principle responsible for this 
advantage? Is unity per se a condition of 
coherence? Is it an irreducible datum, OF 
can its effects be derived from other de- 
terminants? More particularly, can these 
effects be referred to conditions that at 
only concomitant with unity, and whic? 
themselves lack organizational character’ 
Had we demonstrated that these other co?" 
ditions could account for the results, {© 
significance of the investigation would have 
shifted appreciably. To be sure, it pd. 
still be a matter of interest to have show * 
that unity implies the presence of Ot 
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characteristics important for learning and 
recall. But in that case the relation of con- 
sequence would be between the data of re- 
call and properties such as intraserial homo- 
geneity or perceptibility; the condition of 
unity would have become merely an index 
of the presence of these other properties. 

It is hardly our purpose to suggest that 
the consequences of unity are not suscepti- 
ble of further analysis. The preceding ex- 
amination demonstrates that certain particu- 
lar interpretations are not valid, and raises 
the presumption that the search for an 
answer will have to take relational prop- 
erties of the stimulus into direct account. 


Sources of Perceptual Unity 


We would like to understand better the 
grounds for the fact that a unitary stimulus 
joins its parts more intimately, thus produc- 
ing characteristic effects in memory. An 
answer requires a more detailed study of 
the stimulus. The following is an attempt to 
re-examine the stimulus relation that was 
here most extensively studied, that of form 
and mode on a contour line. 

It should first be understood that the re- 
lation of form and mode is not an instance 
of any of the grouping principles d 
by Wertheimer. These principles Cg 
entirely to what we have here called t е 
property of form. To be sure, Wertheimer 
often illustrated these by means of patterns 
of dots, the latter functioning as modes. His 
concern, however, was solely with the par- 
ticular grouping that was realized out of the 
alternatives that were theoretically possible. 
For this Purpose the character of the modes 
Was of hardly any account. In contrast, the 
question we have raised concerns precisely 
the relation between the form that 15 gen- 
crated by modes that “carry” the form. 

It is also important to observe that the re- 
lation of form to mode is not a usual in- 
Stance of a part-whole relation. A mode is 
Not part of an encompassing form т the 
Same sense that a side is part of a square. 

his follows from the fact that the modes 
Contribute to the form primarily by virtue 0 
their position; they have the function 0 
Points in generating a form. The specific 


shape of the mode is far less relevant for 
this purpose. This can be easily demon- 
strated. Different modes can serve to deline- 
ate the same form (see Figure 1) ; in these 
cases there is no difficulty in identifying the 
constant form. Conversely, the same modes 
can delineate different forms; and again 
one notes readily the identity of modes and 
difference of forms. й 

We have called the relation of form and 
mode one of constitution, since the modes 
are strictly constituents of the form. The 
relation is also one of dependence; one as- 
pect of the stimulus requires the other. It is 
probable that stimuli identified as unitary 
are as a rule dependently related, either in 
the sense that one cannot occur without the 
other, or because they alter each other. In 
that case the distinction between a unit and 
a duum would be clearly defined: the latter 
would involve nondependence and absence 
of (or extremely weak) mutual modifica- 
tion. 

At this point we may venture a further 
step, based on the observation that form and 
modes were identical in locus. The form- 
mode relation may be a special case of a 
general principle which can be stated as fol- 
lows: The spatial coincidence of any two 
properties or aspects is a sufficient condition 
of unity. Actually it is not certain that the 
form-mode relation is one of coincidence. 
The following would provide a more de- 
cisive test of the principle stated above. Let 
one stimulus series be a set of colored 
forms, each on a white ground. That is to 
say, the area of each form is of a homo- 
geneous and distinctive hue. Let the mem- 
ory task be that of recalling (or matching ) 
the hue of each form. One can construct a 
parallel series of identical forms, with the 
ground of each in homogeneous and distinc- 
tive hue, while the inner areas of the forms 
are white. The first series is an instance of 
coincidence of color and form, in contrast to 
the latter. In terms of the principle stated 
above, the first stimulus series should be 
superior in matching form with hue." 


17 Experiments employing this procedure will be 
reported subsequently. 
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It would be an error to consider the prin- 
ciple of spatial coincidence an instance of a 
law of proximity. The latter (for which we 
have found no support under the conditions 
of this investigation) presumes a continuous 
relation between distance and a given psy- 
chological function. We have seen, though, 
that there develops a strong difference be- 
tween complete spatial coincidence and a 
condition that departs only slightly from it 
(compare Experiment 1 with Experiments 
11 and 12). 

We have not attempted to explain why 
the relation in question favored functional 
coherence. Given the restricted degree to 
which form and mode alter each other, the 
strength of the obtained effect is indeed im- 
pressive. The following remarks are in- 
tended only as suggestive. 

First, we note that the modes of this in- 
vestigation had two properties: they were 
forms, and they functioned as points in gen- 
erating a form. To us it seems deserving of 
investigation to establish whether a double 
function is a condition favoring coherence. 
At present, there is no evidence on this 
point. 

Second, it may be going too far to say 
that form and mode do not interact. Some 
modes may well be more compatible with a 
given form than other modes (see Figure 
8). Thus, a circular mode (Figure 8a) 
tends to blur an angular form such as the 
[p especially at the apex points, but is 
ess affected by angular modes (see Figure 


8b). At the other extreme, there is an out- 
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standing relation of fittingness between a 
form and a mode of the identical form (see 
Figure 8c). At the same time, modes with 
least individuality as forms in their own 
right, such as dots and continuous lines, 
least disturb the form property (see Figures 
ЗА and 8e), and are suitable for the deline- 
ation of all shapes. 

Although there is no advance relation of 
requiredness between form and mode, they 
may be more or less compatible once they 
are paired. Thus, compatibility may intro- 
duce a more intrinsic relation between form 
and mode. Also, the possibility arises that 
in the unitary condition the S is more 
directly centered on the relation between 
form and mode. For example, he may be 
sensitive to the fact that a curved form 1s 
carried by angular modes. This formula- 
tion does not explain away the obtained dif- 
ference between the unitary and nonunitary 
stimuli, since a difference of sensitivity of 
the kind here assumed would itself be a 
function of the property of unity. In addi- 
tion, it is doubtful whether such distinctions 
can adequately account for the order of the 
obtained effects. 

Finally, it is a question for separate in- 
vestigation whether the more usual category 
of part—-whole relation produces effects simi- 
lar to those under discussion. The interpre- 
tation put forward in this study, namely, 
that perceptual unity is a condition of co- 
herence in recall, requires an affirmative 
answer.!$ 

This discussion may serve to illustrate 
the fruitfulness for the present problem of 
an analysis of the stimulus. (а) The consti- 
tutive relation was distinguished from the 
part-whole relation and from the conditions 
of perceptual grouping hitherto studied. (b) 
We have shown that the constitutive rela 
tion is one of dependence, and suggest 
that dependence may be a property of UP, 
tary stimuli generally. (c) A principle o! 
spatial coincidence was stated as one condi 
tion of phenomenal unity. (4) The questio" 
was raised whether the presence of a dou © 
function favors coherence. (e) The relevant 
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18 Experiments dealing with this question will b 
reported subsequently. 
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of the part-whole relation to the present 
problem was described, and shown to fur- 
nish a test of the general interpretation 
growing out of this investigation. 


Perception and Association 


This investigation deals with the relation 
between perceptual properties and coherence 
as evident in the data of memory. It there- 
fore makes contact with the older problem 
of the relation of perception to association. 
The major theories of association have at 
this point followed different, indeed op- 
posed, assumptions. These we will now ex- 
amine, in order to see their relation to the 
present problem. 

1. What one may call associationistic doc- 
trines of association hold to a sharp separa- 
tion between perception and association. 
This is a direct consequence of the historical 
fact that these doctrines started with the as- 
sumption that psychological contents were 
sense data, which had the status of inde- 
pendent, irreducible elements. This starting 
point dictated the scope and conception of 
the associative process. In the first place, it 
was to association that опе had to turn for 
the explanation of order in mental life. 
Sense data, being discrete, lacked order; 
they had to be combined into stable se- 
quences, and this was the function of asso- 
ciation. There were thus two major prob- 
lems for this psychology: the prn 
and description of the sense data x ae 
ments (and eventually the study о = 
physical energies that are their pond 
Stimuli), and the investigation of t ^ e 
ner in which they become joined. The i3 Ч 
damental task of associationistic inquiry 
Was to account for the joining of one ae 
datum to another, the process that was 
responsible for the constitution of pati 
Ogical objects, and in general for the 
Ciples of order that unorganized sense data 
lacked. 

Given the sense data thesis, i 
Consequences followed for the interpretation 
Of association as a process. If the function 
Of association was to join data that are 
Mitially unconnected, then the noting of 
Sense data was one kind of event, the asso- 


s, a number of 


ciation of them another. On this basis an 
association was conceptualized as the for- 
mation of a bond or pathway between un- 
changing contents, and independent of the 
character of the contents. Some association- 
istic theorists, such as John Stuart Mill, 
were not content with this formulation, sus- 
pecting that association often produced a 
more intimate joining of contents. It was in 
this connection that the notions of “fusion” 
or "mental chemistry" were brought for- 
ward, but the problem was not to be faced 
until the appearance of the gestalt theory of 
perception. 

This sharp separation between the data 
and their association served as the theoret- 
ical basis for the experimental study of as- 
sociations. Let it be noted that, in terms of 
the present discussion, an association has 
been historically conceptualized as a process 
occurring between units; the sense data of 
associationism were discrete, sovereign 
units. The experimental movement left this 
assumption untouched. To be sure, the 
actual study of associations has rarely 
worked with elements ; typically, the proce- 
dure has been to establish associations be- 
tween complex contents. But the failure to 
question the traditional assumption had the 
following consequences. First, association 
was always studied between distinct units. 
Thus, the prevailing assumption perpetu- 
ated a uniformity of procedure and dis- 
couraged efforts to vary deliberately the re- 
lations between the terms that were brought 
into association. Second, it forestalled ques- 
tions about the properties of the stimuli that 
were studied. The study of associations has 
largely taken the stimulus for granted, con- 
centrating on the connections alone. The 
result was to maintain the sharp separation 
between perceptual and associative opera- 
tions. 

The subsequent advances in the field of 
perception, and in particular the study of 
unit segregation and formation, have not in- 
troduced any substantial modification into 
this traditional position. The investigator 
may assume that principles of perceptual 
organization determine the formation of 
units. But these perceptual events, he holds, 
serve only to set the stage for the subse- 
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quent, and wholly different, associative 
operations. Perceptual conditions may de- 
termine the identifiability and clarity of the 
stimulus, but association proper is an activ- 
ity following its own laws. This is in effect 
to deny a direct influence of perceptual re- 
lations on association, while acknowledging 
that perceptual conditions exert a limiting 
effect on associative operations. The latter, 
to be sure, must work on what is perceptu- 
ally given; perceptual principles decide what 
the units are, or what will be associated. But 
once these conditions are given, associative 
operations enter that are autonomous. Thus, 
the data of perception play a role in current 
associative thinking analogous to that of the 
elements in the earlier forms of association 
doctrine. 

The findings of this investigation, which 
establish a relation between perceptual 
Properties of the stimulus and functional 
coherence, ‘raise the following questions 
for the position of associationism. If the 
phenomena here studied are all instances 
of one process, it becomes necessary to re- 
vise current associative formulations at a 
decisive Point, and to assert that perceptual 
relations exert a DIRECT effect on the 
formation of associations. This conclusion 
would come into doubt if it were established 
that the coherence of terms within a unit is 
different in process from the coherence be- 
tween one unit and another, a question to be 
discussed in the following sections. In that 
case, it would be necessary to conclude that 
the traditional Study of association has 
dealt with a restricted question, that it has 
not encompassed the full range of func- 
tional connections, and that the stimulus 
conditions for association must be specified 
in perceptual terms, 

2. The gestalt studies of perception of the 
last 50 years have been the basis of a radi- 
cally different account of the relation of 
perceptual to associative events, It reverses 
completely the earlier emphasis, Proposing 
to replace an associative theory of percep- 
tion with a perceptual theory of associa- 
tions. This is the intent of the theory for- 
mulated by Kohler (1929, 1941). That per- 
cepts are organized signifies that the parts 
are in interaction, or that they mutually al- 
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ter each other. Now the material of recall 
preserves, often quite correctly, the organ- 
ization of earlier experience. It follows that 
memory traces, or the physiological bases of 
memory, are the aftereffects of preceding 
organization. This is to say that the terms 
associated no longer retain their independ- 
ence; they are parts of a unified organiza- 
tion. In this formulation, an association is 
the persistence of a preceding organization, 
not an event added to the terms brought into 
association; in short, an association is fun- 
damentally different from a mere connection 
between terms that remain independent, It 
also follows that conditions favoring organ- 
ization in experience favor trace organiza- 
tion, and therefore recall? 

The results of this investigation provide, 
we propose, the most direct evidence for 
the relation of perceptual organization to 
coherence in memory. In this sense they 
furnish confirmation for the theory of 
Kohler. At the same time, neither of the 
principal theories of association has dealt 
directly with the problem of this study, 
namely, with the distinction between co- 
herence within a unit and the coherence of 
one unit with another. It remains to be 
seen what the relation of the present prob- 
lem and findings is to these positions. 


A Further Distinction between Unitary and 
Nonunitary Coherence? 


We have hitherto concentrated on the | 
quantitative differences in memory between 


two kinds of stimulus conditions. We will 
now explore the thought that the difference 
in question is not adequately described in 
quantitative terms alone, that the two con- 
ditions are separated in а more systematic 
and far-reaching way. Finally, we will con 
sider whether one process of coherence will 
suffice to account for both sets of data. 

The problem can best be introduced bY 
referring to a known fact concerning asso" 
ciative recall, but one that deserves to be 


19 The theory of Köhler concerns the effects of 
organization generally, and is not restricted to ре!“ 
ceptual organization. The general formulations are 
of course, intended to include perceptual organiza 
tion. 
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better-known. The recall of an association, 
more particularly the recall of b when a is 
given, may fail for two fundamentally dif- 
ferent reasons. 1. Recall will fail, at the 
time of test, if there is failure to recognize 
the given term a. This assertion follows 
from an assumption we consider compell- 
ing, namely, that associative recall presup- 
poses a prior step of recognition (Kohler, 
1940). 2. Recall of an association may not 
occur although the starting term a is recog- 
nized; in that case there is a failure in the 
recall of an association. j 
We now propose to examine 
that failure to complete a unitary stimulus 
has its source exclusively in the first of the 
two alternatives described above, but failure 
to complete a nonunitary pair, which often 
occurs despite recognition of the first mem- 
ber, is due to failure of associative recall. 
This may be restated positively as follows. 
When part of a unitary stimulus is recog- 
nized, the missing part will be invariably 
completed; in contrast, one of the members 
of a nonunitary pair may be recognized 
without producing completion of the other 
member. This distinction, if empirically 
established, might be a criterion of impor- 
tant differences in the underlying processes. 
Before proceeding, let us note the condi- 
tions that have to be met in order to decide 
whether failure of recall is due to failure of 
recognition or of the recall of an associa- 
tion. First, it must be established that the 
association in question was formed. Second, 
it is necessary to determine whether recog- 
nition of the terms occurs at the time of test. 
Despite the obviousness of these require- 
ments, there is very little concrete knowl- 
edge at present of the contribution of these 
factors. The investigation of memory has 
almost totally overlooked the distinction be- 
tween these sources of failure of recall, a 
consequence of the neglect of the funda- 
mental difference in process between recog- 


nition and recall. 
One consequence 
hypothesis for the dat; 
Recall of a unitary st" 
plete; that is, recall К 
Should not occur one Wit 
(Implicit is the requirement 


the thesis 


e follows from the stated 
data of this investigation. 
mulus should be com- 
of form and mode 
hout the other. 
that failure to 


recall a unitary stimulus should also be 
complete.) Since the present investigation 
was not initially designed as a test of the 
hypothesis, the following re-examination of 
the data will be somewhat indirect and only 
suggestive. 

1. One source of evidence comes from a 
re-examination of the matching data of Ex- 
periment 8. 

Experiment 8 included, in addition to 
matching, a test of recognition of form. We 
now selected those forms in the unitary 
series that were correctly recognized, and 
computed for each S the proportion of cases 
in which these forms were also correctly 
matched. The mean of these proportions 
was 63.8; that is, on the average 6466 of 
recognized forms were correctly matched. 
It may seem that this result is negative for 
the hypothesis in question, but this would 
be an incorrect inference. Аз we have seen, 
the criterion put forward requires that the 
unitary stimulus should have been ade- 
quately perceived at the time of learning. 
If, for example, the 5 initially perceived 
the form but for some reason neglected the 
mode, subsequent partial recall would have 
no bearing on the hypothesis. We lack, 
however, direct information concerning the 
perception of modes in this condition. For- 
tunately, we can obtain an approximation to 
such a measure from Experiment 6, in 
which the conditions of learning were iden- 
tical with those of Experiment 8, but the 
test was one of recognition of the forms 
and modes, respectively. The results appear 
in Table 18. 

Proceeding as before, we selected all in- 
stances in Experiment 6 of correct recogni- 


TABLE 18 
RELATION or RECOGNITION OF FORM TO PAIRING 
(in %) 
Experi- Experi- Experi- 
ment 6: ment 8: ment 9: 


"Synthetic | Matching: | Matching: 


Pairing" |Intentional| Incidental 
Unitary 70.4 63.8 45.1 
Nonunitary 84.0 31.6 19.2 
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tion of form in the unitary series and com- 
puted for each S the proportion of instances 
in which the paired modes were also recog- 
nized. The resulting values give us a meas- 
ure of “synthetic pairing" of recognized 
forms. The significance of this measure is 
that it states the limit of actual pairings that 
can occur under the given experimental con- 
ditions. Actual pairing cannot exceed the 
level of synthetic pairing, which is an index 
of the joint perceptibility of forms and 
modes that belong to the same stimulus; the 
former can only be of lower frequency than 
the latter. 

The mean proportion of synthetic pair- 
ings in Experiment 6 was 70.4; that is to 
say, on the average 70% of the recognized 
forms were accompanied by the recognition 
of their modes. The difference between this 
value and 63.8%, the value obtained in Ex- 
periment 8, is small and not significant (р 
> .10). 

The preceding analysis assumes that the 
recognition data of Experiments 8 and 6 
can be compared despite the differences of 
conditions at the time of test. (In Experi- 
ment 6 new items were interspersed with 
the old for the test of recognition; not so 
in Experiment 8. Further, the recognition 
test of Experiment 8 followed a test of 
matching.) Despite these differences, the 
levels of recognition of form were remark- 
ably similar in the two experiments. There- 
fore, the comparison seems justified as well 
as the assumption that the two variations 
produced similar levels of recognition of 
modes, 

м a се Up account the percepti- 
completeness = up At He z 
stimulus is in accordance with th ya 
e criterion. 
Nevertheless, a more direct test will be 
necessary for a definitive conclusion. 

2. The data of Experiment 10 are also 
pertinent. Here we employed a procedure 
of familiarization preceding the main task 
which insured the full identifiability of the 
stimuli at the time of learning. We also 
tested for recognition of the original stim- 
ulus series. For the purpose of testing the 
present hypothesis, this procedure may be 


the most sensitive. Our concern is to estab- 
lish whether a given unit has been retained 
as a unit ; given this aim, it is desirable that 
the test reduce the burden of recall proper 
to a minimum. 

The interpretation of the results has to 
take into account the following detail: the 
test of recognition was itself a source of 
interference. We found that the position of 
the stimuli in the test series affected accu- 
racy of recognition. Accordingly, we exam- 
ined the level of recognition of stimuli that 
came first in the test series (these stimuli 
all appeared toward the middle of the learn- 
ing series, in the fourth position, a position 
susceptible to interference). The level of 
accurate recognition of the unitary stimuli 
was 96% (in contrast to the level of 52% 
for the nonunitary stimuli). This result 15 
based on one half of the stimuli in the uni- 
tary series. 

The pertinence of this result derives from 
the following consideration. The procedure 
of familiarization made certain that there 
would be complete recognition of the parts 
of the unitary stimulus. Under these condi- 
tions we find that recognition of the belong- 
ingness of the parts is at a maximum, which 
is in accordance with the hypothesis. 


3. Turning to the data of free recall, the 
hypothesis requires completeness of recall 
of the unitary stimulus. We have found 
that such inseparability was by no means al- 
ways the case: isolated recalls and incorrect 
pairings did occur with measurable fre 
quency. Before drawing a negative conclu 
sion, the following points need to be coU 
sidered. (a) The criterion in question pre 
supposes, as we saw, that the stimulus W2 
adequately perceived at the time of direct 
experience. We know that this was not th® 
case throughout, if we may judge by the 
recognition data (see Experiment 6). ( ) 
Further, it was quite possible to register 2 
stimulus simultaneously in a unitary manner 
and discretely. The modes in particula" 
were often highly familiar; the Ss We? 
capable of perceiving them as parts of the 
stimulus and at the same time of recordi” 
them separately. Such multiple registrati?! 
could also account for isolated recall. 
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Finally, uncontrolled guessing could account 
for certain incorrect pairings. It should be 
noted in this connection that incorrect pair- 
ings occurred infrequently (see especially 
Experiments 1, 4, and 5 which are most 
pertinent). The free recall data are thus 
inconclusive. Failure of recognition, sepa- 
rate registration, and guessing limit their 
value for the present purpose. . 

Although the results of the preceding 
analysis are not definitive, they offer some 
support for the hypothesis when the neces- 
sary conditions are most adequately met. 
But further investigation, directly oriented 
to the hypothesis, is clearly needed. Such 
investigation will require the simultaneous 
determination of data not ordinarily ob- 
tained in experimentation: it will be neces- 
sary to establish that learning had occurred, 
and to relate, at the time of test, the data 
of recall to those of recognition. 

Before concluding, the following remark 
may be in order concerning the criterion of 
invariable completion. It would be going 
too far to assert that the trace of a unitary 
percept cannot be disrupted under any con- 
ditions, that it must always persist—or dis 
appear—as a unit. Our intention is to make 
a less sweeping assertion, {0 suggest that a 
unitary trace will persist as a unit under 
“normal” conditions, as long as no extreme 
measures are taken to disturb it. Admit- 
tedly, what is to be regarded as normal is 
left undefined at this point. Despite this 
shortcoming, and despite the uncertainties 
surrounding the hypothesis, the criterion 
directs us toward a valid problem. It asks 
us to consider what parts, or aspects, ofa 
stimulus distribution are inseparable, or— 
since the criterion is a severe one—nearly 
inseparable. It can hardly be said, in the 
light of present knowledge, that investiga- 
tion will fail to discover instances that meet 
the criterion. 


ASSOCIATIVE AND NON ASSOCIATIVE 
PROCESSES 


The evidence for two substantially differ- 
ent kinds of stimulus conditions and effects 
prompts us to ask whether the respective 
operations are similar or distinct. Gen- 


erally, in psychology, we take the joint re- 
call of “two” data as evidence of associa- 
tion. Is it equally clear that the coherence 
of terms belonging to the same unit is an 
instance of association? There is the possi- 
bility that one uniform process is responsi- 
ble generally for functional coherence, 
whether it occurs within a perceptual unit 
or between units. If so, it would follow that 
the conditions we have called unitary pro- 
vided superior conditions for association. 
But we need consider also the alternative 
that there are distinct kinds of coherence, 
and that these are a function of the particu- 
lar unit relations. 

This question has not attracted attention 
mainly because the study of units has been 
the province of perceptual investigation, 
while associative studies have limited them- 
selves to the combination of units. It has 
become evident, though, in the light of this 
investigation, that the properties of units 
are pertinent to the general problems of as- 
sociation. Further contributing to the pre- 
vailing assumption are the procedures cur- 
rently employed in investigation, which are 
best designed to reveal quantitative differ- 
ences between effects, thus creating the pre- 
sumption of a uniform process. To speak 
of association is harmless as long as the 
term refers to a uniform set of procedures 
and effects. But as soon as it refers to dif- 
ferent conditions and effects, it becomes 
charged with theoretical implications, and 
ceases to be merely descriptive. Surely, it 
would not be right to conclude that the uni- 
tary condition is associative simply because 
the procedure employed in studying it was 
that of the association experiment. 

What grounds may there be for the hy- 
pothesis of a nonassociativ e kind of coher- 
ence? We would first mention an observa- 
tion that is only suggestive. The typical 
associative situation (as represented in the 
nonunitary condition) begins with the ap- 
prehension of unrelated data that are subse- 
quently joined; perhaps it is this transition 
from an initial to a later state that charac- 
terizes the step of association. The situation 
is importantly different when the data are 
from the start apprehended as members of 
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one unit. In the latter case the relation be- 
tween the terms evolves directly as a percep- 
tual process, lacking the transition from an 
initial to a terminal state. We are mindful 
that this observation is not conclusive; the 
difference in question could well be func- 
tionally continuous. 

The following consideration seems to us 
the most compelling: the unitary condition 
produced an effect that no other pairing 
condition achieves. We know of two sub- 
stantially different association experiments : 
that which pairs heterogeneous unit terms, 
and the pairing of units that stand in clear 
relations of fitting, of which the investiga- 
tion of Prentice and Asch (1958) is an ex- 
ample. The unitary condition differs from 
the former of these in ways already de- 
scribed. It is now necessary to call attention 
to the critical difference between the unitary 
condition and that produced by related 
units. In the latter case coherence was also 
ata high level, but it was a consequence of 
intrinsic perceptual relations. The unitary 
condition produced comparable effects when 
the terms in question were heterogeneous. 
We know of no stimulus relation other than 
that internal to a unit that brings about this 
effect. 

We need now consider whether these 
effects can be referred to a single process, 
Since such a solution would introduce a most 
desirable orderliness in our thinking. In 
doing so let us refer to the three conditions 
of coherence that have been identified: (a) 
the pairing of distinct and heterogeneous 
units, as in the classical association experi- 
ment; (6) the pairing of units related by 
properties such as symmetry, completion, 
etc. (Prentice & Asch, 1958) ; and (c) the 
"pairing" of heterogeneous constituents of 
units. The three cases differ in fundamental 
respects: (a) and (c) are alike in hetero- 
geneity but not in coherence; (5) and (c) 
are comparable in coherence but not in 
heterogeneity; while (a) and (b) differ in 
both respects. To assert that they are func- 
tionally continuous, we believe, requires the 
following arguments. 

One would have first to reverse an earlier 
assumption and maintain that all of the pre- 

ceding conditions are unitary, and that they 


differ solely in degree of unity. More par- 
ticularly, this interpretation asserts that any 
pairing, however heterogeneous the terms, 
forms a unit; that coherence is a function of 
degree of unity or interdependence among 
the members of a stimulus distribution ; and 
that the empirical discontinuities are a con- 
sequence of an underlying functional con- 
tinuity. This is the position that Köhler 
(1929) has taken in a classic discussion, in 
which he speaks of association as “coherence 
within the unitary trace of an equally uni- 
tary or organized experience. = wp: 
282) 20 

To refer all functional coherence to or- 
ganizational conditions, we believe, is to go 
in a necessary direction. The formation of 
the most recalcitrant association is evidence 
of interaction between the terms in question, 
and to this extent might also properly be 
called an instance of unit formation. We 
are not convinced, however, that this formu- 
lation, which was directed to the typical 
association experiments, fully takes into ac- 
count the problem which the present investi- 
gation poses. The latter requires us to ask 
whether a unitary relation that is formed 
spontaneously is functionally identical with 
the unity established between heterogeneous 
terms. It is well-known that the properties 
and effects of interaction can differ consid- 
erably; indeed, interaction may also result 
in segregation. (To judge by the phenom- 
enal evidence, this describes one aspect of 
the figure-ground relation of Experiment 
11.) Thus, the issue is substantial un- 
changed when one takes into account the 
organizational character of rote associa- 
tions. 

The following would be a more direct 
way of deciding between the alternatives. Tf 
the stimulus conditions we have contrasted 
are functionally continuous, this could be 
demonstrated by finding intermediate stim- 
ulus distributions and observing whether 


20 At another point Köhler (1941) has ава" 
asserted that "the fact that a unitary process forms 
a unitary trace is equivalent to what we call e 
association. From the present point of view associa” 
tion is therefore simply coherence within the unitary 
trace of a unitary experience” (p. 493). 
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the results of coherence are correspondingly 


intermediate. Such investigation would be 
timely. We see the possibility of system- 
atically varying for this purpose the degree 
of fittingness (and of other relations) be- 
tween paired units. At this time, though, 
we know of no way to bridge the unitary 
relation we have here studied with the сог- 
responding dual relation. 

If there are distinct kinds of coherence, 
the resulting memorial processes must be 
correspondingly different. We will now at- 
tempt to derive this difference from the dis- 
tinction, examined in the preceding section, 
between recall that invariably follows recog- 
nition and recall that may fail despite recog- 
tion. For the purpose of discussion we will 
assume what is still to be proven, namely, 
that there are specifiable stimulus conditions 
in which recall follows invariably upon 
recognition. 

The particular statements to be examined 
are as follows. 

1. Completion in memory that invariably 
follows recognition is an instance, not ot re- 
call, as is generally assumed, but of recogni- 
tion. But completion in memory that may 
or may not follow upon recognition requires 
a distinct step of recall. | 

2. The first kind of completion is поп- 
associative, the second is an instance of as- 
sociative recall. This formulation states an 
independent, although still quite. е: 
property of an association : the criterion о 
an association is that recall of it can fail fol- 
lowing upon recognition of the preceding 
term with which it was paired. This point 


deserves notice in the light of the observa- 
tion that the occurrence of an association 
in 


has hitherto not been defined except à 
terms of the employment of a particular ex- 
perimental procedure. | 

3. It follows that completion in memory 
of part of a unitary stimulus is an instance 
9f recognition, which is nonassociative, but 
completion between heterogeneous units is à 
Case of associative recall. 

For illustration we compar 
memory results: | | А 

la. Recognition of a unit lacking natura 
Subparts. 


e the following 


1b. Completion of part of a unit lacking 
articulated subparts. 

2. Completion of a unit with articulated 
subparts. 

3. Completion of 
pairs. 

la. An observer inspects a black elliptical 
contour line on a white ground. At the time 
of test the same figure appears with a num- 
ber of others; let us suppose that it is recog- 
nized, The occurrence of recognition we in- 
terpret in accordance with Kóhler's (1940) 
account of the Hóffding function. The per- 
ception of the ellipse produces as an after- 
effect a trace corresponding in organization 
to that of the percept. On its later appear- 
ance, the elliptical stimulus produces a new 
percept corresponding to current stimula- 
tion. The occurrence of recognition signi- 
fies that the latter process makes contact 
with the trace of the earlier percept, and 
that the contact is made possible by the 
similarity between them. 

1b. The stimulus is as above—an elliptical 
contour line. At the time of test the ob- 
server is shown a portion of the ellipse, with 
instructions to reproduce the earlier stim- 
ulus. For completion to occur, the part- 
stimulus must be recognized as similar to 
the one that was seen earlier. That is to 
say, the present perceptual process, on the 
basis of similarity, must arouse the trace of 
the earlier percept. This situation differs in 
an obvious respect from the one just de- 
scribed earlier. The stimulus is now reduced 
at the point of test; the trace is contacted by 
a process that corresponds to it only par- 
tially. But the trace is of a unitary form 
without parts; the ellipse possesses а whole- 
quality that is given directly as a perceptual 
datum. (A sophisticated observer may of 
course apprehend the shape in a more ana- 
lytical way, but we will assume that this is 
not the rule in immediate perception.) If, 
as in this case, the trace of the earlier or- 
ganization is highly unified, it may not be 
possible to reactivate it without doing so in 
its entirety. Once trace contact of the kind 
described has occurred, the entire trace of 
the old stimulus will be activated, a direct 


consequence of its unity. 


heterogeneous unit 
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well-known and strong condition of coher- 
ence. This effect of meaning has been vari- 
ously interpreted. Often, it has been re- 
ferred to familiarity produced by past ex- 
perience. While this is often the case, one 
may doubt the adequacy of this interpreta- 
tion in all cases. A relation of meaning is 
also an instance of conceptual unit forma- 
tion; the advantage that meaning introduces 
is doubtless a consequence of this fact. The 
question one might raise is whether the con- 
tribution of meaning in the setting of the 
association experiment is at all unique, and 
whether the advantage it provides is not 
completely a function of the relation of 
unity it establishes. 


SUMMARY 


The traditional study of associations is 
part of a more inclusive problem, one that 
concerns the relation between the objective 
structure of the stimulus and the coherence 
of its parts, as evident in perception and 
memóry. The classical association experi- 
ment has restricted observation to a limited 
phase of this problem. It has concentrated 
on the coherence established between pairs 
of heterogeneous units. Theory and investi- 
gation in this area have not considered the 
formally analogous operations within well- 
structured units. This study was concerned 
with the effect of the perceptual relation of 
unity between the parts of a stimulus distri- 
bution on the capacity to retain them jointly. 

Employing the technique of the classical 
association experiment, and visual forms as 
stimuli, the perceptual relations between the 
parts of a stimulus distribution were sys- 
tematically varied. The following conditions 
were studied : the relation of constitution or 
composition, the relation of figure and 
ground, the relation of inclusion. and the 
relation of contiguity between | 


pairs of 
heterogeneous units, 


The contents or terms 


composing the several relations were either 
identical or closely similar. The relations in 
question were phenomenally describable as 
either unitary or nonunitary. 

Coherence of recall, or the ability to recall 
jointly the parts of a stimulus distribution, 
was substantially superior when the data be- 
longed to the same perceptual unit. Errors 
in recalling “what went with what" were 
regularly more frequent when the parts of a 
stimulus distribution belonged to perceptu- 
ally distinct units. These differences per- 
sisted with different tests of recall, and also 
when the test was that of recognition, thus 
establishing that the effect occurred at the 
point of direct experience. 

The results demonstrate a direct relation 
between the unit properties of the stimulus 
and coherence. Given a difference of the 
unit relation between two stimulus distribu- 
tions, the direction of difference between 
them in coherence of recall can be predicted. 

The question was considered whether co- 
herence within and between units may differ 
in process. The hypothesis was examined 
that the spread of excitation from one uni- 
tary trace to another requires a step not 
present in the spread of excitation within a 
trace. A test of the hypothesis was pro- 
posed, which rests on the differential role 
of recognition in the unitary and non- 
unitary instances. A re-examination of the 
empirical data offered a degree of support 
for the hypothesis, but a definitive conclu- 
sion awaits further investigation that meets 
strictly the experimental conditions required 
by the tests. 


The consequences of the preceding hy- 
pothesis for a theory of associations were 
examined, as was also its relation to current 
theories of associations. 

The relation of this study to a range of 
questions in general psychology was dis- 
cussed. 


— С> 
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APPENDIX 


SEPARABILITY OF FORM AND MODE 


Units differ in degree of coherence. There is also 
reason to believe that units differ with respect to 
the range of conditions under which they will re- 
main coherent. Indeed, resistance to disturbing 
conditions may be an index of degree of unity. The 
following experiments were an attempt to explore 
the unity of the form-mode relation by means of а 
new procedure. 

In the preceding investigation the Ss were uni- 
formly directed to note both the form and mode of 
each stimulus. We now undertook, by means of 
altered instructions, to see to what extent it is 
possible to direct the S exclusively to the form or 
mode alone of the unitary stimulus. The degree to 
which this separation can be achieved may serve 
as an index of lack of unity. In short, we take 
separability of the properties of a unitary stimulus 
as a criterion of unity. 


Experiment 15: — Separability of Form 
from Mode I" 

The unitary stimuli of Experiment 5 were the 
materials of this experiment. The procedure was 
that of Experiment 5, except for the following 
modification. Тре Ss were instructed that the task 
was to recall only the forms. The instructions 
stated in part: "Your task is to recall the larger 
shape alone. You may be interested to know that 
We are comparing the recall of the larger forms 
under this condition with one in which the contour 
line is always drawn continuously (an illustrative 


card with the same form in continuous contour 
and with another modal contour was here shown). 
It just so happens that you have been assigned to 
the group with different contour lines. After I 
have shown you the cards once, I will ask you to 
draw as many of the forms as you can remember, 
using a continuous line, such as this (pointing to 
the illustrative continuous sample)." This explana- 
tion provided an effective rationale. 

The test, which came after 3 min., was one of 
free recall. At this point, however, the Ss were 
instructed to reproduce, whenever possible, the 
modes as well as the forms as they were initially 
given. j 

The design of this experiment thus combines an 
intentional with an incidental procedure. It directs 
the Ss to focus on one aspect of the stimulus—the 
form—and to disregard the other; at the same 
time the perception of the one must carry with it 
the registration of the other. In this respect the 
technique is similar to that of the Külpe abstraction 
experiment. 


Experiment 16: Separabiliy of Mode 
from Form 


The stimuli and the procedure were identical 
with those above, except that the 5$ were in- 
structed to recall the modes only. Thus, this con- 
dition was incidental for the form, and intentional 
for the modes. 

The results of both conditions appear in Table 19. 


TABLE 19 


RECALL SCORES 
Experiments 15 and 16 


2 3 4 
N +(FM) | -(FM) F M M 
Experiment 15 12 M 2 1.2 3.8 0.1 8.3 
%, 38.4 14.1 46.5 1.0 
Experiment 16 17 
a. Totally obedient 55 5 M 8 1.2 0.2 5.2 9.4 
% | 29.8 12.8 2.1 55.3 
b. Totally disobedient Ss 3 M 1 1.3 0.7 1.0 aT 
% | 68.9 13.8 6.9 10.3 
с. Intermediately obedient Ss| 9 м 2 0.7 0.8 2.9 10.6 
% | 60.2 6.5 8.4 27.9 


Le 
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1, We will first examine the results of Experi- 
ment 15. 

Perhaps the most noteworthy finding is that the 
Ss were capable of complying with the instructions. 
Of a total of 14 Ss, 12 were completely obedient 
to the instructions, judged by their answers to 
questions, at the conclusion of the experiment, con- 
cerning the procedure they adopted during learning. 

The quantitative results, which appear in Table 
19, provide convincing evidence of the effectiveness 
of the instructions. Most pertinent is the com- 
parison of form and mode recall. The mean recall 
of forms was 6.0; of modes, 2.3. Each S recalled 
more forms than modes. Equally telling are the 
data for isolated recalls: of 47 isolated recalls, 46 
were of forms. в 

As further evidence of the effect of this condi- 
tion, we may cite the following. At the conclusion 
of the experiment the Ss were shown the modes, 
cach drawn as a straight line, and tested for recog- 
nition. (The decision to proceed in this way was 
made when the experiment was already in progress, 
and was applied to only 8 5$.) Although this was 
not an adequately controlled test of recognition, the 
results are instructive. The mean number of modes 
recognized was 5.3, a level considerably below that 
obtained in Experiment 6. Individual recognition 
scores ranged from 2 to 8. M 

In the light of these findings, it is not surprising 
that the level of joint recall was low. In absolute 
terms the level was one fourth of that in the U 
condition of Experiment 5, and less than one half 
of that in the non-U condition. Correct pairings 
were also depressed proportionately, when com- 
pared with the level of the non-U series of Experi- 
ment 5. 

2. The majority of Ss of Experiment 16 found 
it far more difficult to obey the instructions. They 
reported that the form remained dominant, and that 
they involuntarily referred to it when attempting to 


` commit the modes to memory. Compliance with the 


instructions was again judged on the basis of the 
Ss’ replies to questions put at the conclusion of the 
experiment concerning their procedure during 
learning, and without knowledge of their recall 
performance. Out of a total of 17 Ss, only 5 suc- 
ceeded m obeying the instructions; 3 were com- 
pletely disobedient; and 9 were judged to be inter- 
mediate. 

In Table 19 we categorized the recall data in ac- 
cordance with this classification. Despite the small 
numbers of cases, the results fall into a clear 
Pattern. 

a. Fully obedient Ss performed in a manner that 
|v corresponds to the performance of 
periment 15. There was a strik- 
{ mode over form recall, the 
respective means being 7.2 and 22. Out of 27 iso- 
lated recalls, 26 were recalls of mode. All 5 Ss 
showed the same superiority of form over mode 
recall, 


completely 
nearly all 5s in Ex 
ing preponderance O 


Correct pairings were markedly depressed; there 
were only 7 such instances. Even this figure is in- 
flated, since all cases of joint recall were of the 
first two stimuli in the learning series, that is, of 
those appearing during the phase of least crowding. 

Thus, separability of form and mode was 
achieved in this, as in the preceding experiment, 
although by fewer Ss. 

b. The three totally disobedient 5$ differed 
strikingly from the preceding group. Recall of 
forms and modes was now identical. Most note- 
worthy was the drop of isolated recalls. In all 
respects these Ss resembled closely the U Ss of 
Experiment 5; this was so especially with respect 
to the frequencies of correct and incorrect pairings. 
Further, correct pairings were no longer confined 
to the early stimuli in the series. 

c. The intermediate Ss fell between the two pre- 
ceding groups. There was a preponderance of mode 
recall; the latter accounted for the 61.3% of all re- 
calls. Pairing was close to the U variation of 
Experiment 5. 

We tested all Ss for recognition of form. These 
were shown at the conclusion of the free recall 
test, drawn in continuous contour. The mean recog- 
nition score was 5.9, again considerably below the 
level obtained in Experiment 6. Individual recogni- 
tion scores ranged between 2 and 10. 

Complete separability was not achieved in either 
experiment. But a substantial degree of separa- 
bility occurred under both sets of instructions?! 
Also, it was easier to focus on the form than on 
the mode, a result in accordance with earlier evi- 
dence that recall of form predominated in the 
unitary condition (see pp. 8, 11). 

A full interpretation of these findings must await 
further investigation of the relative ease of sepa- 
ration of unitary stimuli of other kinds. It 
would be of particular interest to identify those 
part-properties of stimuli that cannot be divorced 
from each other in perception and recall. 

Under the conditions here described, one prop- 
erty of a stimulus was made focal and another 
peripheral; at the same time the perceptual situa- 
tion was such that the ignored property could not 
but be registered. It was hardly possible to inspect 
the form without registering the modes as a visual 
datum, and conversely. That the ignored property 


?1 The procedure of testing for recall probably 
underestimates the actual level of coherence. This 
in no way weakens the weight of the comparison 
between the present data and those obtained under 
the fully intentional conditions of Experiment 5, 
which also employed the test of free recall. 

?? We have not done corresponding separability 
experiments with the nonunitary series, Such in- 
vestigation might be appropriate as part of a sys- 
tematic inquiry, but the results obtained earlier with 
the nonunitary series justify the surmise that such 
a procedure would probably be extremely effective. 
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suffered in recall, and even more important, т 
recognition, is evidence of the importance of atten- 
tion, as distinct from sheer stimulation, in learning. 

The following caution must be noted in inter- 
preting the preceding results. Separation of form 
and mode was achieved only as a consequence of a 
special attitude induced by instructions to the S. It 
would be unwarranted, and probably misleading, to 
infer that form and mode were equally separable 
under the conditions of the main experiments of 
this investigation, and that this separability was 
counteracted by an equallv special attitude in the 
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opposed direction. It is more likely that we have 
here tested separability under extreme conditions, 
and that neutral conditions would demonstrate @ 
considerable coherence of form and mode. These 
comments make clear the need for study of the 
coherence of the form-mode relation, as of other 
units, under "neutral" conditions, such as will 
avoid directing the S either to the entire stimulus 
distribution or to any aspect of it, Such a proc 

dure should throw light on the modes of аррг 

hension of stimulus relations under relatively spon” . 
taneous conditions. 
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т is taken for granted that meaningful 

material is learned more easily than 
nonmeaningful material McGeoch and 
Irion (1952) write that "a high positive 
correlation between meaningfulness of ma- 
terial and rate of learning holds under a 
very wide range of conditions" (p. 479). 
Thus, when subjects are given the task of 
learning a list of meaningful items (e.g., 
adjectives) and a list of nonsense syllables 
of identical length, the adjective list is 
learned to criterion in fewer trials. Also, 
structured sequences of words (e.g. sen- 
tences) are easier to learn than lists of 
randomly selected words. 

Demonstrations of this nature together 
with other related experimental findings 
have led to the general acceptance of the 
facilitating role of meaningfulness. They 
have not, however, produced any unanimity 
in accounting for this effect of meaning. 
Five principal explanations which have been 
advanced are: 

l. Associationist explanation. Meaning- 
fulness as а determinant of learning gains 
its status as a consequence of positive trans- 
fer. Meaningful material allows the learner 
to draw on a vast store of previously estab- 
lished associations. As a result, the present 
process of learning has only to strengthen 
Some particular bond between the items 
Which was already in existence. Pairs of 
_ Nonsense syllables, on the other hand, do 
not enjoy the advantages of previously 
FEL ducted when the first 
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formed associations; since they require the 
formation of new associational bonds, they 
are more difficult to learn. 

Recently, this point of view has been 
restated in a somewhat modified way. For 
example, Mandler and Huttenlocher (1956) 
state that the learning of new associations is 
mediated by previously learned associa- 
tions to the stimulus- and response-terms" 
(p. 424). Therefore, "the more such asso- 
ciations are elicited by the stimulus- and 
response-members in a  paired-associate 
task, the greater will be the probability that 
a particular combination of a stimulus-asso- 
ciation and a response-association can be 
found to provide a mediating bridge or a 
‘meaningful’ link between the two" (р. 
424). Mandler and Huttenlocher asked 
subjects to associate to each of a number of 
nonsense syllables for an interval of 30 
seconds. The syllables were then ranked 
according to associative frequency (1.е., the 
number of associations evoked). Subjects 
were then given the task of learning a list 
of eight pairs of syllables, each pair having 
а different level of associative frequency. 
А significant correlation was obtained, "be- 
tween speed of acquisition of a nonsense- 
syllable pair and the mean associative fre- 
quency previously established for its two 
components" (p. 428). 

Presumably, the same reasoning could be 
applied to explain the facilitative effect of 
meaning. A meaningful word would evoke 
a greater number of associations and, there- 
fore, pairs of such words would be more 
likely to have "mediating bridges." 

2. Organizational explanation. А second 
point of view, represented by  Kóhler 
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(1929), contests the associationist explana- 
tion “mainly because the same words have 
occurred thousands of times in other much 
more regular connections, which ought ab- 
solutely to inhibit the assumed associations 
in this particular case” (p. 287). This in- 
hibition is to be expected particularly where 
one takes pains to pair words which would 
not have been experienced together often, 
if at all, in the past. Kóhler argues that the 
emphasis on association is misplaced and 
that the true explanation must be made in 
terms of the possibilities for organizing the 
learning material. That is, if while reading 
the pair of nouns, pen—girl, the learner 
visualizes a girl writing with a pen, then the 
learner will have “experienced some well 
organized . . . whole,? and it may be that 
here learning is so easy because organizing 
is so much easier in this case than in that 
of indifferent nonsense syllables” (Kohler, 
1929, pp. 287-288). If we determine which 
material is more amenable to organization 
and in turn determine where association is 
Strongest, we shall find them coincident. 
Thus, “where Organization is naturally 
strong we have Spontaneous association; 
Where there is practically no organization 
association does not occur until some or- 
ganization is created intentionally" (Kohler, 
1929, р. 290). Köhler points out that if 
the noun pairs, lake— sugar, pencil—gaso- 
line, etc. are presented to a group of sub- 
Jects, they will be learned more easily than 
an equal number of nonsense syllable pairs. 
Certainly, Köhler argues, no one can say 
that between such categorically disparate 
Items as pencil and gasoline there exist old 
associations which merely require some de- 
gree of Tepetition or exercise for success 
in the learning task, Tf learning is facili- 
tated under these conditions it can be only 
as 2 result of organization, It should be 
noted that Kéhler’s arguments do not apply 
to the more recent versions of the associa- 
tionistic View, eg... Mandler and Hutten- 
locher. 
which в ima (o RT e approach, 

inson’s (1932) “law 


2 We shall refer to 


d Such organized whol 
"conceptual units." oles. as 


of acquaintance” would view the theories 
described above as superfluous. The crucial 
factor is the familiarity of the individu 
items used in the learning task. Meaning 
material is also familiar; therefore, it woul 
be easier to master a list of pairs of meat 
ingful items than nonsense items. When 
adjectives are used the learner does 1° 
have to be concerned with mastering t 
individual items; these are already famili 
and thus easily distinguishable. Не < 
concentrate mainly on forming an asso 
tion between the members of a pair. Where 
nonsense material is used, the learner must 
first learn the individual unfamiliar ite™ 
and, in addition, form an association bg 
tween the items. The additional task w0 
make nonsense material harder to lear”: 


There is some experimental ее 
bearing on this argument. Waters (19 e 
reported that prefamiliarizing nonse 
Syllables did not facilitate learning. 0 
demonstrated that varying amounts of р 
reading and reciting of syllables did т 
significantly facilitate learning them WU 
new order. In his experiment three СОЙ 
tions were compared : d 

Condition I. The subjects first b. 

aloud the list to be learned three E 

They then proceeded to the formal l€ 

ing of the list. The serial order for le? p 

ing was the same as for initial read! 

Condition II. 'The same as Condition, » 
except that the order of syllables jn 
learning was different from that du” 
the initial reading. gn 

Condition III. Learning without 
prior reading of the list. e 


Waters considered the compariso® И. 
tween Conditions II and III to be ‘ofl 
crucial test for the effect of familiariZ2" of 
He found no difference in the numbé [0° 
trials to learn the lists under the tw? 
cedures. 


Sheffield, however, did not сообу, 
Waters’ conclusions. In Sheffield’s Ca 1 
study, syllables were prefamiliarized jg 
procedure which did not involve the f° jø 
tion of associations between syllables oct 
the prefamiliarization session the SU ge 


saw the word or syllable, vocalized i^ 


— "LS 


yt 
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tried to find it quickly in a list of three- 
letter words. When the word was found, 
it was marked and vocalized again. These 
items were then paired and compared with 
pairs of new items. The prefamiliarized 
pairs were learned more easily than the new 
pairs. 

A. third study on the role of familiarity 
has been reported by Hovland and Kurtz 
(1952). These authors suggest that Waters 
failed to demonstrate an effect of prior 
familiarization because his technique re- 
quired the unlearning of associations which 
were formed during learning. In their ex- 
periment, 12 subjects learned lists of 6, 12, 
and 24 nonsense syllables, both with and 
without prior familiarization of the items. 
The prefamiliarization technique consisted 
in having the subjects study the nonsense 
syllables which were presented in constantly 
changing order until they knew the syllables 
well enough to recognize them and also to 
supply the missing letters when parts of 
the syllables were presented. The effect of 
prior familiarity with the syllables was to 
reduce the number of trials to mastery by a 
moderately constant amount for all lengths 
of lists. 

While familiarity (in the sense described 
above) has been shown to play a role in 
associative learning, the evidence is not 
Sufficient to prove that the superiority of 
meaningful material is entirely due to this 
factor. 

Explanations closely allied to the famili- 
arity and association approaches have ap- 
peared in recent years. These stress the role 
of linguistic structure and patternings rather 
than the familiarity of the items or the 
mediation by specific associations: 

4. Transitional (or sequential) probabil- 
ity. Words or letters in speech and writing 
do not appear in random sequences but fol- 
low one another in predetermined orders. 
Transitional analysis investigates the de- 
pendent probabilities of sequences of lin- 
guistic units: | 

Given the unit "x," what is the prob- 
ability that it will be followed by unit ny"? 
Some sequences never appear ш English 
While others vary in frequency of occur- 


rence. The letter “q,” for example, is never 
followed by any letter other than “и”; there 
is a greater likelihood that “е,” as the first 
letter of a word, will be followed Бу “п” 
or "s" than by “у.” The word “is” or 
"says" will more likely occur after "he" 
than will "electron." Higher order prob- 
abilities can be obtained by posing the ques- 
tion: given the sequence of units XY or 
ХУ 7, what is the probability that a par- 
ticular unit will be the next to appear? 

Miller and Selfridge (1950) argue that 
sequences of high transitional probability 
will be easier to learn than those of low 
frequency. To obtain evidence for this 
hypothesis, they constructed word sequences 
with different orders of approximation to 
standard English. For example, a zero- 
order was derived by selecting words at 
random from a dictionary. For a first-order 
approximation, a word is chosen if it fre- 
quently follows one word before it; for a 
second-order, two words before it, etc. The 
Shannon technique was used to generate the 
various degrees of approximation. The sub- 
jects were given the task of learning 10-, 
20-, 30- and 50-word passages ranging in 
sequential probability from a zero-order to 
text passages. The results based on an 
immediate recall test are in accordance with 
the hypothesis that higher-order sequences 
are learned more readily than are lower 
ones. 

An important finding in this experiment 
is that above the fifth-order there is very 
little improvement in recall for all sequence 
lengths. Some passages of low order were 
learned as well as text passages. For ex- 
ample, the third-order 20-word nonsense 
sequence—‘“family was large dark animal 
came roaring down the middle of my friends 
love books passionately every kiss is fine”— 
was learned as well as a “sensible” passage 
of 20 words taken from text. 

Miller and Selfridge (1950) conclude 
from this result that it is not meaning per 
se which facilitates learning but the pres- 
ence of sequences frequently encountered 
in the past: 

Thus there are kinds of nonsense that are as 
easy to recall as are meaningful passages. The 
significant distinction is not to be drawn between 
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meaning and nonsense but between materials that 
utilize previous learning and permit positive trans- 
fer and materials that do not. If the nonsense 
preserves the short range associations of the 
English language that are so familiar to us, the 
nonsense is easy to learn (p. 183). 

Word pairs or sequences which occur to- 
gether frequently in speech and writing 
have built up strong associations and are 
therefore easier to learn when presented in 
an experiment than are nonsense syllables 
or random assortments of words. 

It is apparent that the concept of tran- 
sitional probability as applied to the prob- 
lem of associative learning is essentially 
a restatement of the first approach con- 
sidered above. 

5. Grammatical habits. Almost all of 
the verbal arrangements we experience and 
produce are ordered according to the rules 
of syntax. Nevertheless, a sentence, for 
example, cannot be defined simply as a 
highly probable sequence of words. Lin- 
guists distinguish between syntactical order 
and dependent probability computations. 

“One’s ability to produce and recognize 
grammatical utterances is not based on 
Notions of statistical approximation and the 
like” (Chomsky, 1957, p. 16). Again, “If 
we rank the sequences of a given length in 
order of statistical approximation to Eng- 
lish, we will find both grammatical and 
ungrammatical sequences scattered through- 
out the list; there appears to be no 
Particular relation between order of ap- 
proximation and grammaticalness” (Chom- 
sky, 1957, p. 17). Grammatical structure 
can be easily recognized even in an arrange- 
ment of nonsense syllables, e.g., “The vapy 
koobs desaked the citar molently” (Fries, 
1952, p. 111). 

It can be assumed that the great fre- 
quency of grammatical sequences in our 
experience has given rise to verbal habits 
or learning sets. These habits would facili- 
tate the learning of new material arranged 
in accordance with grammatical structure; 
in this case, there will be positive transfer 
from previous learning of material with the 
same structure. Ungrammatical or random 
sequences of words would be more difficult 
to learn. 


Some evidence for this view is cited by 
Osgood (1957) : 

. one of my students, Mr. Albert Swanson, 1 
compared the ease of learning nonsense sequences | 
that retained the structure of the English sentence $ 
{тот which they were derived, for example, 

The тай vlems oothly um the glox nerfs, , 
with matched materials in which the grammat! 
cues had been eliminated, for example, 

тай vlem ooth um glox nerf. 
The structured series were learned more easily tha” 
random lists (p. 88). 

We have reviewed several explanation 
for the superiority of meaningful mater! 
in learning. Four of these account for the 
effect in terms of transfer from previous 
learning of one kind or another—of spec! 
items, associations, verbal sequences, 1 
grammatical habits. Köhler's explanation 
the only one based on the referential asp. 
of language. As we have seen, Шеген 
some evidence that familiarity factors ? 
important. But this kind of evidence 90 
not necessarily rule out the role of meanifi? 

In the experiments to be reported in th® 
paper we attempt to distinguish mean” 
from familiarity and to investigate 
separate contributions of these factors ' 
paired-associate learning. id 

We know of only one study—by Sheff 
(1946)—which deals with this problem. А 
stated above, Sheffield found that famil, 0 
ized items are easier to associate than ^; 
novel ones. He also raises the questio? jl 
to whether meaning makes an additi? f 
contribution in the learning process. ro 
field defines meaning in S-R terms 25 * 3 
perceptual response which is aroused P’ se 
stimulus pattern; this perceptual resp? фе 
is implicit and has been connected tO yo 
stimulus pattern by past learning. Acc ў 
ing to this definition, all familiar sti d 
have meaning based on past experience nf 
even novel stimuli may have some mer oid 1 
based on stimulus generalization. She, by 
attempted to make an item meaning! gg | 
attaching a distinctive perceptual resP ). 
to a word or syllable ("symbolic traint”? X, 
For example, when the subject saw ite й) 
he was to circle it with a green e 
item Y was to be blacked out with а P yas 
etc. In familiarity training the respo? 
the same to all stimuli used. 


cal 


| 


CA 
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The paired-associate learning task was 
designed so that prefamiliarized material 
and material which had been made “mean- 
ingful" by symbolic training could be com- 
pared with novel items. The results show 
that familiarity and meaning were equally 
effective. "Symbolic training had no effect 
other than the correlated familiarization" 
( Sheffield, 1946, p. 194). 

More recently, other stimulus-response 
definitions of meaning (Osgood, 1953; 
Noble, 1952) have been proposed. In the 
present study we are not concerned with 
the problem of the functional nature of 
meaning—whether it consists of autonomic 
responses or cortical events. For our pur- 
poses, the dictionary definition of meaning, 
as that which is named or referred to by a 
word, seems adequate. The sound CAT is 
meaningful when used by the speaker and 
comprehended by the listener to refer to a 
particular object or class of objects. Words 
may also refer to events, activities, abstract 
concepts, etc. 

Of course, a word must be familiar in 
order to be meaningful; a novel word (in 
isolation) could have no meaning. Never- 
theless, familiarity in itself may not give 
rise to meaning. The syllable "GYX" may 
become quite well-known with repetition, 
yet when heard or read it does not arouse 
any experience beyond itself—no one 
concept, or activity is referred to or named. 
It is therefore meaningless. - ; 

Some classes of familiar linguistic units 
possess structural meanings (Fries, 1952), 
ie. they are meaningful only as parts of a 
sentence. In isolation such words as. who, 
“the,” “for,” etc., do not refer to or identify 
any kind of experience. Words like ' cat, 
“table,” "run," "talk," “war,” "pretty, оп 
the other hand, do not have to be given 
in a larger verbal context to have meaning. 
Presented by themselves, these words will 
call up specific mental content. One has no 
difficulty in defining any one of them; we 
however, can be defined only as a member 
ofa particular class of words, or as having 
a particular grammatical function. и 

We shall, therefore, use the term "mean- 
ingful" to refer to types of words such as 
nouns, verbs, and adjectives, for which 


lexical meanings can be readily elicited 
when they are presented by themselves. 
The term "familiar" will be applied to non- 
sense syllables which have been experi- 
enced previously and also to types of 
words such as prepositions and conjunc- 
tions. (It is perhaps preferable to say that 
these latter words have meaning only in 
the limited sense described above than to 
call them entirely meaningless.) 

Тре hypotheses which guided the experi- 
mentation to be reported were: 

l. Familiarity by itself will facilitate as- 
sociative learning. 

2. Meaning will enhance learning even 
more because it makes possible organization 
or conceptual unit formation as described 
by Kohler. 


PLAN or RESEARCH 


To obtain evidence for these hypotheses 
the following studies were carried out: 

1. The effect of familiarity. For this pur- 
pose, prefamiliarized nonsense syllables 
were compared with novel ones (Experi- 
ment I). 

2. The effect of meaningfulness. Mean- 
ingful words (nouns) were compared with 
equally familiar words of limited meaning 
(conjunctions and prepositions) and non- 
sense syllables (Experiments IT and III). 

3. The explanation of the superiority of 
conjunctions and prepositions over nonsense 
syllables found in Experiments II and ПТ. 
To see whether this result is due to 
familiarity or to the limited meaning of 
the former items, conjunctions were com- 
pared with familiar nonsense items (Ex- 
periment IV). 

4. The role of transitional probability. 
To see whether the superiority of meaning- 
ful words over merely familiar ones (Ex- 
periment II and III) can be attributed to 
this factor, we compared noun pairs with 
pairs of words which possessed high tran- 
sitional probability (Experiment V). 

5. The role of organization or conceptual 
unit formation. Pairs of pictures represent- 
ing specific objects were compared with 
concrete noun, abstract noun, and verb pairs 
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(Experiment VI). Further, pairs of nouns 
joined by a "sensible" connecting word 
were compared with pairs having an irrele- 
vant connective (Experiment УПа) and, 
finally, pairs of pictures of objects which 
were arranged to form a unit were com- 
pared with picture pairs in which the items 
remained separated as in the traditional 


paired-associates method (Experiment 
VIIb). 


Experiment I 


, In the first experiment we were interested 
in demonstrating the facilitative effect of 
familiarity with material made familiar 
during the experimental period. 


Method and Materials 


А ‘The general procedure consisted in prefamiliar- 
izing nonsense syllables which were later formed 
into pairs to be learned as paired associates. 
Learning of the prefamiliarized nonsense syllable 
list of six pairs was compared with the learning 
by the same subjects (Ss) of a list of six pairs of 
new nonsense pairs under identical learning condi- 
tions. All of the nonsense syllables were chosen 
from Glaze's (1928) list of syllables at the 80% 
level of associative value, 


The prefamiliarization technique employed in 
this experiment was designed to overcome certain 
po oo present in methods previously used by 
15 er investigators. The Ss were required to learn 
2 short lists, each consisting of four numbers and 
опе nonsense syllable, Each item in the list was 
printed individually on a blank card. Ss read the 
ium aloud and received two trials with every 
ist; а test of total recall followed each trial. In 
almost all cases the S was able to recall the 
syllable perfectly after the first trial, None of the 


Ss Suspected the purpose of this procedure. The 
instructions were as follows: 


— ап experiment to determine how well 
ies ү carn certain material. I have made up 12 
one ists each consisting of 5 items, 4 numbers, 
I Py сани Syllable (explained if necessary). 
1 = m to try to learn these lists one by one 
in the following way... . At no time will I 
ask m to recall a list which you have already 
passed and been tested for, so don't try to keep 
them all in mind or your task will become 
impossible. 

| This procedure differs from previous techniques 
in two respects. First, the method ensures familiar- 
ization by inducing an intention to learn as con- 
trasted with a purposeless repetition of the material 
(cf. Waters). Moreover, the ma! 


; ; terial is not рге- 
sented in serial order. The technique of я 


reduces to near zero the possibility that any ob- 


tained effect of prior familiarization can be 
attributed to the existence of remote associations 
between the syllables rather than to familiarity as 
such. 

The 12 syllables thus familiarized were then 
paired, thereby constituting one learning list of six 
pairs. Every effort was made to reduce to а 
minimum any factors which might cause one pair 
to be easier than any other, e.g., both members of 
a pair having identical initial consonants, etc. 
he pairs were printed on blank 3 X 5 cards in 
capital letters. The stimulus item was printed on 
the bottom left-hand corner of the other side of 
the card. During learning, each pair was presented 
for 5 seconds (as measured by metronome) with a 
2-second interval between each pair. During the 
test trials the stimulus item was presented alone 
for a 5-second interval with a 2-second interval 
between presentations. 


Subjects, Forty-eight college students partici- 
pated in this experiment. Thirty-two 55 served in 
the main experiment and 16 Ss took part in a 
supplementary experiment. 

Experiment I. The Ss in the main ex- 
periment were divided into two groups. 
The 16 Ss in Group 1 were treated in the 
following way: (а) Prefamiliarization ses- 
sion as described above. (b) Two learning 
trials with the list of now familiar items. 
(c) Test after an interval of 1 min. follow- 
ing the second learning trial. (d) Two 
learning trials with new pairs of syllables. 
(e) Test after a 1 min. interval following 
the second trial. For 8 5$ of Group 1 the 
prefamiliarized syllables consisted of those 
syllables which had been new for the other 
8 Ss. Thus, each item served both as fa- 
miliar and as new for different Ss. In Table 
1 we refer to this fact by recording the data 
under two headings, List A and List P- 
Thus syllable DEV, which was new in List 
A, was familiar in List B. The results fot 
Group 1 are found in Table 1. The meat 
number of prefamiliarized pairs learned was 
3.1, and the mean for the new pairs, 2.1. 

The same procedure was employed for 
the 16 Ss in Group 2 except that Ss in this 
group always learned and were tested ОЛ 
the new material first and then proceede 
to prefamiliarization, followed by learnins 
of the familiar list and test. The results f°" 
this group are also presented in Table 
The mean number of new pairs learned was 
2.6, and 3.8 for the familiar pairs. Р 

This experiment was then repeated for 
the following reason: the new pairs ip 
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List B of the main experiment fared pretty 
well (particularly in Group 1 and to a much 
lesser degree in Group 2), with the result 
that the differences between familiar and 
new in List B are not as impressive as the 
differences obtained with List A. For this 
reason, the 12 syllables making up these 
Six new pairs were interchanged to form 
different pairs which then constituted the 
new (nonfamiliar) list for the 16 5$ par- 
ticipating in this experiment. The syllables 
to be prefamiliarized were the same sylla- 
bles which were familiarized in List B of 
the main experiment. Group 3 consisted of 
eight Ss who learned the nonfamiliar list 
first and then the familiar pairs. These 
Ss retained a mean number of 2.9 new 
pairs and 3.9 familiar pairs. Group 4 con- 
sisted of eight new Ss who learned the 


familiar list first. The mean number learned 
was 2.5 for the familiar and 1.4 for the 
nonfamiliar. Consideration of the frequen- 
cies in Table 1 shows that reordering the 
pairs had the effect of bringing out the 
difference between the familiar and non- 
familiar syllables. This confirms our con- 
jecture that the factor responsible for the 
lack of significant difference for this mate- 
rial as given in the original List B is not 
the ineffectiveness of familiarization but 
rather the very high level of learning for 
the new pairs by the Ss in this group. 
Inspection of the last column of Table 1 
reveals that the difference between the 
means for the prefamiliarized and new- 
syllable pairs are almost identical in the 
various experimental conditions (1.0, 1.2, 
1.0, 1.1), and all are in the expected direc- 


TABLE 1 


NuMBER OF PREFAMILIARIZED AND NEW SyLLABLE-Pairs LEARNED IN EXPERIMENT І 


Experiment I 


Group 1 
List A List B Lists A & B Comb. Mean 
ili 50 3.1 
Кашар z A a i 
ew 1 s 
N 8 
Group 2 
List A List B Lists A & В Comb. Mean 
New 19 23 42 2.6 
Familiar 31 30 61 3.8 
N 8 8 16 
Group 3 Group 4 
Order 
"Hem Mean Familiar-New Mean 
23 F n 1.4 
-9 
redis 31 : 8 "n 
N 8 
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TABLE 2 


COMBINED RESULTS FOR EXPERIMENT I 


Number Mean Number 
Material of Pairs of Pairs SD 
Learned Learned 
New 110 2.4 1.36 
Familiar 162 3.4 1.35 
N 48 


Note.—za» — žy is significant at the .01 level; { = 5.3. 


tion. In Table 2 we have combined the 
data for all conditions. When this is done, 
the mean number of nonfamiliar pairs 
learned is 2.4 and the mean number of 
familiar pairs is 3.4 for 48 Ss. The standard 
deviation is 1.36 and 1.35 for nonfamiliar 
and familiar pairs, in that order. The 
difference between the means in favor of 
familiar pairs is significant at better than 
the .01 level (f = 5.3; df — 47). 


Experiment II 


In this experiment, we used items which 
are equally familiar but which differ in 
meaningfulness—we compared nouns with 
conjunctions and prepositions. If the two 
types of words were learned equally well, 
the meaning hypothesis would be invali- 
dated; if, however, an advantage were to 
be shown by the nouns, then such results 
would be attributable to the factor of mean- 
ing. Nonsense syllables were also used, to 
be compared with the conjunctions. 


Materials and Procedure 


Two series composed of three different types of 
material were used. Each series consisted of three 
lists of six pairs of items: a list of conjunction 
and preposition pairs composed of three conjunc- 
tion pairs and three preposition pairs, a list of six 
noun pairs, and a list of six nonsense syllable pairs. 
Two series were used in order to allow for a 
larger sampling of verbal material and thus reduce 
the possibility that our results might be attributable 
to the particular words chosen. An attempt was 
made to use noun pairs Consisting of members not 
ordinarily paired in everyday use. A check of the 
Thorndike-Lorge (1944) word count showed that 
in general, the nouns, conjunctions, and prepositi ons 
we used occur with equal frequency. The nonsense 


syllables were chosen from Glaze’s list of 80% 
syllables. All of the lists are reproduced in Table 4. 

Each pair was printed on a blank white index 
card. The individual left-hand test items were 
printed on the back of these cards. Each list wet 
presented to the 5 for one trial, and each раї 
this list was presented for 5 seconds as indicat 
by a metronome. The Ss were asked to read 
pairs aloud (to pronounce, not to spell, 
syllables) and to try to associate the items 0 
cach pair with one another in preparation forts 
test of their memory which would follow so 
after learning. The Ss were informed that Е 
sequence was not to be learned since the order 
the pairs would be randomized for the test presen 
tion. After one learning trial, the Ss were Se 
to work on a paper and pencil maze. This 1 = 
polated activity lasted for 1 min. The SUN d 
then tested by being asked to recall the rig 
hand member of the pair upon presentation 0’, 
left-hand item. This procedure was follow ve 
all three lists in each series. The order of aed 
sentation during the test was always rando def 
by shuffling the cards to a new order. The : 
in which the different lists (materials) was that 
sented was systematically prerandomized i pt 
the different types of materials followed an Also 
ceded each other an equal number of times: ^ jed 
the order of pairs within each list was cham a 
for each S. Each test item was presented fo 
4-second interval. LB 

Subjects. Thirty-six college students particip? 
in this experiment. One group of 18 Ss lea 
Series I; a second Group of 18 learned Series 
The Ss were assigned in alternating order t? 
or the other Series. 


| 
g 
d 
ed 
И. 
one 
Results 
The results for this experiment are ИК 
sented in Tables 3 and 4. For Series 1 18, 
mean number of pairs learned was 2^ ~ 
1 


ТАВІЕ 3 


T 
NUMBER OF PAIRS LEARNED IN ExPERIME" 


Com- 50 

Series|Series|Series ws і 

п 1&ИМеа Ч 
Ме 


Nouns 
Conjunctions & 

Prepositions 
Nonsense Syllables 
N 


49 | 55 
31 
16 
18 


ol 
Note.—The difference between the mean number j Рейс 
pairs retained and the mean number of conjunctions 203640 d 
sitions was significant at the .01 level ( = 3.9). The Cosine 
between the mean number of conjunctions and E ret 
retained and the mean number of nonsense syllable 

was significant at .001 level (¢ = 6.7). 


| 


+ 


e 


| 
| 


k 
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TABLE 4 
ITEM ANALYSIS OF RESULTS IN EXPERIMENTS 
II AND Ш 
Number of Ss Who Responded Correctly 
to Each Pair 

Experi- Experi- 

Material ments? | Material ments? 
Series I II III | Series II TL ш 
car—leaf 9 11 | boat—wall 10 13 
sink—park 9 11 | match—rent 9 15 
fish—rug 8 12 | mat—card & 12 
аїг—сһїп 10 13 |slip—dog 8 13 
wood—shirt | 7 14 | suit—key 10 12 
knee—mask 7 12 | pen—sand 10 17 
hence—or 6 10 | still—if 5 9 
also—but 8 9 |now—and 6 10 
then—yet 6 9 | how—nor 4 12 
at—írom 5 10 |in—by 6 10 
off—out 5 10 | to—for 6 10 
down—of 5 8 | with—up 4 11 
vax—neb 1 6 | gos-bif 1 9 
muc—yeg 2 9 | fij—koz 3 6 
het—tas 2 5 |jor—gan 4 8 
fer—wid 4 8 |ril—dut & d 
mot—daw 3 7 | pir—yan 2 $ 
lav—fow 0 5 | dev—ruc 3 1 


^ 18 Ss served in each condition. 


and .7 for the nouns, conjunctions and 
prepositions, and nonsense syllables, in that 
order; for Series П the mean number of 
pairs learned was 3.1, 17, and 9, When 
the data for Series I and II are combined, 
the mean number of pairs of nouns, con- 
junctions and prepositions, and nonsense 
syllables was 2.9, 1.8, and .8, respectively. 
The standard deviations were 1.79, 97, and 
.87, also in the same order. 

An item analysis of these resuits is pre- 
sented in Table 4. АП the differences were 
highly significant (using a one-tailed test 
for paired values, nouns vs. conjunction- 
= 3.9, significant at .01 level 
with 35 df; conjunction-prepositions vs. 
nonsense syllables, # = 6.7, significant at 
001 level with 35 df). An analysis of in- 
dividual 5 performance revealed that 24 
of our total group of 36 learned more 
noun pairs than conjunction-preposition 


pairs, 5 Ss an equal number of both kinds 


prepositons, t 


of material, and 7 Ss more conjunction and 
preposition pairs. Twenty-six Ss learned 
a greater number of conjunction and prepo- 
sition pairs than nonsense pairs, 6 Ss 
showed no difference, and 4 did better with 
nonsense material. 

The results of Experiment II provide 
evidence for the facilitating role of mean- 
ing per se; they therefore contradict Shef- 
field's finding that material which is both 
meaningful and familiar is no easier to 
learn than merely familiar material. This 
contradiction is perhaps attributable to 
Sheffield's ‘conception of meaning which 
obscures the distinction between a familiar 
item and one which is meaningful as well. 

The significance of the superiority of con- 
junction and preposition pairs over non- 
sense pairs will be discussed below (Ex- 
periment IV). 


Experiment Ш 


Before we can conclude that the results 
of Experiment II demonstrate the role of 
meaning (and familiarity) in the formation 
of associations, another possibility must be 
considered. 

Suppose a S has associated A with B 
so that when A is presented he is able to 
recall B. Kohler (1940) has pointed out 
that, on the functional level, the reproduc- 
tion of B must involve two steps. The A 
item must first be recognized, i.e., the 
memory trace of A is activated because of 
its similarity to the present percept of A. 
The second step is the arousal of trace B 
(recall by association). Suppose now that 
the S cannot recall B. If there is no diffi- 
culty in the recognition of A, how is this 
failure to produce B to be explained? One 
possibility is that no bond was formed be- 
tween А and B in the original learning 
situation. Another is that the bond was 
made but for one reason or another the 
trace of B is not available. For example, if 
B is a difficult, unfamiliar nonsense item 
the trace of B may not be readily accessible 
even though the bond between A and В is 
well established. The existence of the 
association could, however, be revealed by 
the use of a matching test; here, the arousal 


10 WILLIAM EPSTEIN, IRVIN ROCK, anp CARL В. ZUCKERMAN 


of B is facilitated by a recognition process. 
Hence, failure to recall B is not evidence 
that the association has not been established. 


Thus, the superiority of the nouns over 
conjunctions and prepositions (and of the 
latter over nonsense syllables) might be 
explained by reference to the problem of 
trace availability. Perhaps all associations 
were equally well formed in the learning 
process; however, if traces of nouns (in 
the В position) are more accessible than 
those of conjunctions, recall scores for the 
nouns will be higher? If this were the case, 
then the differences obtained in Experiment 
II should disappear when learning is tested 
by the matching procedure. This would be 
expected since the accessibility of the asso- 
ciated trace is not critical to performance 
оп а matching test. If, however, differences 
comparable to those obtained with a repro- 
duction test are present with matching also, 


then we may assume an effect on the forma- 
tion of associations, 


Materials and Procedure 


Experiment III Was a repetition of the previous 
one except that Ss were tested by a matching 
Procedure rather than by recall. The same material 
Was used and the learning conditions were identical. 
In the matching test, the A item was presented and 
ће 5 Was instructed to select the correct В item 
from a list Containing all of the B items arranged 
In a circular pattern, 

Subjects, Thirty-six college students participated, 


Ss learned Series [ and the other 18 learned 
eries II, 


Results 


Experiment III are pre- 


nd other details are given 


, er (Series I and IT 
combined). The standard deviations were 


: 1 ; т the same order. The 
following differences between the means 
were significant. Nouns vs. conjunctions 


° Sheffield’s (1946) finding that recall is bett 
when a meaningful item is т the response (1 а. B) 
position than when it is in the stimulus Ge, A) 
position may be explained in this way. i 


and prepositions, # = 4.1, at the .01 level; 
conjunctions and prepositions vs. nonsense 
syllables, ¢ = 3.0, at the .01 level. 

We may note, then, that although the 
absolute scores for all materials were Sig- 
nificantly higher than in Experiment II, the 
differences between the materials were 
maintained on an almost identical level. This 
near identity of the differences between the 
means of the various materials for the WO 
experiments suggests that it is not neces 
sary to assume that there is any additiona 
facilitation of meaningfulness and familiar- 
ity in the recall process. The influence 9 
these factors seems to reside entirely in the 
process of forming associations between 
the two members of a pair. 


Experiment IV 


In this experiment we wanted to E: 
whether the superiority of the conjunctis 
preposition pairs over the nonsense pur 
( Experiments II and III) was due entire? 
to their familiarity rather than to the limite. 
meaningfulness they might possess. For e 
purpose, we compared them with familiaf 
ized nonsense syllables. 


Method and Materials 


The materials consisted of two lists of 6 pait 
each. One list was made up of 3 pairs of zo 
junctions and 3 pairs of prepositions chosen the 
the lists in Series I and II of Experiment II. Jes 
other list consisted of 6 pairs of nonsense syllabi 


also chosen from the lists in Series I and 


TABLE 5 


ш 
NUMBER ОЕ PAIRS LEARNED IN EXPERIMENT 


Series 


а en 77777 Comb‏ ا 
m JI&I|Mean| $77‏ | 1 
1 
Nouns 73 | 82 |155 | 4.3 144‏ 
Conj. & Prep. | 56 62 | 118 3.3 | logy‏ 
Non Syl. 40 45 85 2.4 | 1.‏ 
N 18 | 18 | 36‏ 


со! 
. Note.—The difference between the noun pairs and the evel 
Junction-preposition pairs was significant at the "ble pane 
U = 4.1). The difference between the nonsense syllable. t? 
and the conjunction-preposition pairs was also signi 
the .01 level (¢ = 3.0). 


оаа. н 
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TABLE 6 


Pairs USED IN EXPERIMENT IV 


Nonsense Syllables | Conjunctions & Prepositions 


daw—lav off—out 
yon—cav at—írom 
jor—mot down—of 
vax—fer how—nor 
fij—gos with—up 
ril—gan now—and 


Experiment II. Table 6 contains the materials 
used in this experiment. 

Subjects, Thirty-six college students participated 
as a group in this experiment. 

Procedure. The prefamiliarization procedure was 
essentially the same as that described in Experi- 
ment I with some slight modifications to adjust for 
a group design. There were two familiarization 
sessions separated by 48 hrs. At the end of the 
first session, the Ss were led to believe that their 
participation in the experiment was completed. 
Most expressed surprise when the Е appeared the 
second day. After the second prefamiliarization 
session, one half of the group was asked to leave 
the room. The remaining group then received one 
learning trial with the prefamiliarized nonsense 
syllable now presented in 6 pairs, followed by an 
interpolated interval of 1 min. filled with work 
on matchstick problems. Ss were then tested by 
being required to reproduce in writing the B item 
upon being presented with the A member of the 
pair. When this was completed, the conjunction 
and preposition pairs were learned and tested for 
in the same way. Finally, the second group re- 
entered the room and followed the same procedure 
except that this group learned the conjunctions 
and prepositions first. 


Results 


The results for this experiment appear in 
Table 7. When the data for both groups 
are combined, the mean number of pairs 
retained for the two different materials is 
identical (2.6). The standard deviations 
were 1.14 and 1.51 for the prefamiliarized 
syllables and conjunctions-prepositions, re- 
spectively. А further look at the individual 
scores reveals that 17 of the 36 participating 
Ss learned a greater number of syllables 
than conjunctions and prepositions, AS 
learned a greater number of gy s 
and preposition pairs, and 4 of the 36 Ss, 


an equal number of both materials. 


Since we have found no difference be- 
tween familiarized syllables and conjunc- 
tions we may assume that the superiority 
of the latter material in Experiments II and 
III is attributable solely to its familiarity. 
The further advantage shown by the nouns 
in those experiments, therefore, confirms 
the hypothesis that meaning acts as a 
separate factor in learning. 


Experiment V 


The experiments reported so far support 
the view that the advantage of meaningful- 
ness in learning cannot be attributed to 
familiarity. The nouns and conjunctions we 
used occur with approximately equal #ге- 
quency in the language. But there is an- 
other kind of familiarity to be considered— 
that of certain sequences of words. Al- 
though the particular pairs of nouns 
employed in our experiments may not have 
been frequently experienced together, it 
might be argued that they have occurred 
more often than the pairs of conjunctions. 
In everyday speech and writing we often 
list series of nouns so that there is some 
degree of probability that we have already 
experienced one or more of the noun pairs 
used in Experiment II. It is much less 
likely that any of the conjunction or prepo- 
sition pairs have occurred previously. Con- 
sequently, on the basis of the Miller and 


TABLE 7 


NuMBER OF CONJUNCTION, PREPOSITION, AND 
PREFAMILIARIZED SYLLABLE-PAIRS LEARNED 
IN EXPERIMENT IV 


No. | 
Retained No. Retained 
Order | Order 
Material SyL- | Conj. & | Comb. | СотЪ. 
Conj. &| Prep.- | Mean SD 
Prep. Syl. 
Conj. & Prep. 50 42 2.6 1.51 
Prefam. Syl. 45 48 2.46 1.14 
N 18 18 


Note.—No difference between the means. 
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Selfridge results, one could predict the 
superiority of the noun pairs. 

(Another possible explanation of the re- 
sults of Experiment II is provided by the 
grammatical habit hypothesis. This view 
Stresses conformity with grammatical use 
rather than the statistical frequency of 
specific word sequences. ) 

We do not wish to deny the role of the 
transitional probability factor in learning ; 
we wish to suggest, however, that it is not 
sufficient to account for our results. In 
order to test this view, we need to compare 
our noun pairs with pairs of words which 
occur together much more frequently but 
which cannot be readily organized into con- 
ceptual units. For example, the words 
"here—is" have a high sequential probabil- 
ity but neither word has the meaningfulness 
of a noun. If the noun pairs were to do 
better than material of this kind, then the 
transitional probability approach could not 
explain the results of our previous experi- 
ments. 

We decided also to compare pairs such 
as "here—is" with our conjunction-preposi- 
tion pairs. The effectiveness of sequential 
Probability would be demonstrated if the 
Dew material were learned better. 

ince there has been no systematic com- 
putation of the transitional probability of 
word sequences we chose material in the 
following Way: we scanned several books 
and selected pairs of contiguous words 
Which occurred together quite frequently. 


Material and Procedure 


will agree that the i 
ability. (See Table 8) Ne 2 
cannot be readily Organiz 
List B consisted of th 
pairs and List C the no 
in Experiment IL Th 
was identical with that used in E 


The order in which the th xperiment П. 


ree lists were presented 


* We wish to thank Susan M, Ervin for с: 


our attention to this possibili: P 
munication. 1958, М кош 


alling 
com- 


was systematically randomized. The order within 
each list was also randomized from the learning 
trial to the test and between Ss. 


Subjects. There were 36 college students in the 
experiment. One half of these learned Series 
and the other half Series II. 


Results " 
The results are shown in Tables 8 and 9. ' 
For Series I the mean number of pais | 
learned was 2.3, 2.1, and 3.4 for Lists А, Д 
and С, respectively. The standard deviation 
were .79, .89, and 1.12. The difference 2 
tween the mean obtained with List C and 


TABLE 8 | 
ү 
ITEM ANALYSIS oF RESULTS BY EXPERIMENT 
The Number of Ss Who Responded Correctly 


to Each Pair 
Series I Series II 
Pair N Pair N 

List A List A 
are—not 9 before—any 1 
such—an 4 here—is г. 
will—have 9 and—does PR | 
ат— тоге 5 was—under 1 | 
which—makes — 5 then—it " 
that—this 9 follows—the — ! 

List B List B 
down—of 7 now—and 
then—yet 5 still—if 
off—out 4 how—nor 
at—from 6 with—up 
also—but 9 to—for 
hence—or 7 in—by 

List C List C 
knee—mask 10 match—rent 
air—chin 10 mat—card 
fish—rug 10 suit—key 
sink—park 9 slip—dog 
wood—shirt 10 boat—wall 15 
car—leaf 12 ABE 
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TABLE 9 


MEAN NUMBER OF PAIRS LEARNED IN 
EXPERIMENT V 


Series I Series II Combined 
Mean SD |Mean SD |Mean SD 
List A 2.3 -79 | 2.9 1.09 | 2.6 .96 
List B 2. .89 | 2.0 .91 | 2.05 .88 
List C 3.4 1.12|4.1 .95 | 3.75 .83 


Моше. — АП of the relevant differences are significant at the 
.01 level with the exception of ža — Zs, Series І, which is not 
significant, 


mean obtained for List A and List B is 
significant at the .01 level. The difference 
between the means for List A and B is 
not significant. . 

For Series II the mean number of pairs 
learned was 2.9, 2.0, and 4.1 for Lists A, 
B, and C, respectively. The standard devia- 
tions were 1.09, 91, and .95. The mean 
for List C was significantly better than the 
mean for List A, and this in turn was sig- 
nificantly better than the mean for List B. 

The combined means for both series were 
2.6, 2.05, and 3.75 for Lists A, B, and С 
The standard deviations for the combined 
means were .96, .88, and .83. АП of the 
differences were significant at better than 
the .01 level. 

The results give evidence for the role of 
sequential probability—as shown by the 
superiority of List A over List B (com- 
bined means of Series I and II). But the 
even greater superiority of the nouns clearly 
supports the view that the role of meaning 
cannot be reduced to familiarity of the 
transitional probability kind. 


Experiment VI 


We had obtained evidence that meaning 
plays à role in associative learning above 
and beyond the facilitation due to familiar- 
ity and transitional probability. We then 
attempted to develop further data in sup- 
. port of Köhler’s organization hypothesis. 

We chose materials which varied in the 
extent to which they permitted the poe 
tion of conceptual units. „Сопвтега шов S 

feri ific object or thing, such 
referring to а specific 00) 


as "suit," "chair," "dog," etc.—should be 
easy to build into a unified concept. Ab- 
stract nouns—referring to emotions, ideas, 
etc., such as "grief," "time," “war,” etc.— 
on the other hand, should be more difficult 
to organize. One can, for example, unify 
the pair “cat—leaf” by thinking of a cat 
playing with or eating a leaf. It would be 
more difficult to do something similar with 
the abstract word pair “law—time.” Our 
hypothesis predicts, therefore, that concrete 
noun pairs should be learned more easily 
than abstract ones. Drawings of specific 
objects were also employed. We felt that 
visual representations would be unified 
easily since the S would actually perceive 
the two objects together. We expected the 
drawings to do even better than the con- 
crete nouns. We selected verbs as the final 
class of material. In this case our expecta- 
tion was that verb pairs would probably not 
be as easy to organize as concrete nouns. 
Our hypotheses did not permit a prediction 
as to the relative difficulty of learning verbs 
and abstract nouns. 


Materials and Procedure 


Two series of four lists were constructed. Each 
series was composed of one list of six pairs of 
pictures of objects, one list of concrete nouns, one 
of abstract nouns, and one of verbs. The material 
is presented in Table 10 and Figures 1 and 2. 

Each S learned all four lists. Each list was pre- 
sented for one trial at the rate of one pair every 
5 seconds. The Ss read the pairs or gave the 
name of the depicted objects aloud under instruc- 
tions to learn the list in preparation for a test 


qd а 
d we 
Эх. RUNE 


Fic. 1. Pairs of pictures of objects used in 
Experiment VI, Series A. 
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Fic. 2, Pairs of pictures used in Experiment VI, 
Series B. 


in which they would be asked to supply the missing 
right-hand item upon presentation of the leít- 
hand item. When the learning trial was completed, 
E took 30 seconds to randomize the order of the 
pairs and to remind S very briefly of his task. 
The test was then given. This procedure was 
followed for all four lists individually. The order 
in which the lists were presented was systematically 
prerandomized so that every list followed and pre- 
ceded every other list an equal number of times. 


The order of pairs within the list was randomized 
from S to S. 


The purpose of the two series (A and B) was 
to control for possible chance bias in the material. 
The pictures of Series A were presented as con- 
crete nouns in Series B, and the concrete nouns 
in Series A were presented as pictures in Series В. 
In addition, Series B contained completely new 
lists of abstract nouns and verbs. 


Subjects. Forty-eight college students served as 
Ss. Twenty-four Ss learned the material in Series 
А and 24 Ss learned Series В. 


Results 


The results are presented in Tables 10 
and 11. The mean number learned for 
Series A was 3.9, 3.1, 2.3, and 1.9 for the 
picture, concrete noun, verb, and abstract 
noun lists, in that order. The standard de- 
viations were 1.34, 1.70, 1.67, and 1.30, in 
the same order for the four materials. 

The mean number learned for Series В 
was 3.8, 3.0, 2.3, and 22 for the picture, 
concrete noun, verb, and abstract noun lists, 
in that order. The standard deviations were 
1.18, 1.33, 1.52, and 1.34, in the same order. 

A. comparison of the data of Series A 
with that of Series B as it relates to pictures 
and concrete nouns will show that the effects 
obtained are not produced by any chance 
bias of material. 
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When the data for Series А and В are 
combined, the means were 3.9, 3.0, 2.3, and 
2.0 for the pictures, concrete nouns, verbs, 
and abstract nouns, in that order (N = 
All of the differences were significant €X- 
cept for the obtained difference between the 
abstract nouns and the verbs. See Table 
for analysis of variance and q test. The 
results of this experiment confirm our E 
pectations. Performance was enhanced as | 
direct function of the possibility of formin 
conceptual units. 


TABLE 10 


ZXPE ENT 
ITEM ANALYSIS or RESULTS FOR EXPERIMEN 


The Number of Ss Who Responded Correctly 


to Each Pair 


Series A Series B 
Concrete Nouns Pictures 
suit—key 13 suit—key is 
fish—rug 14 fish—rug i4 
knee—mask 10 knee—mask 13 
wood—shirt 10 wood—shirt $ 
slip—dog 11 slip—dog b 
boat—wall 17 boat—wall 1 

Pictures Concrete Nouns 
house—tree 19 house—tree 5 
boot—glass 12 boot—glass 11 
pencil—door 13 pencil—door 12 
hand—chair 16 hand—chair 12 
pipe—clock 17 pipe—clock 13 
book—dog 17 book—dog 


Abstract Nouns Abstract Nouns 


fame—height 9 death—mood : 
law—time 8 mind—greed 8 
truth—grief 7 luck—norm 8 
peace—fate 5 pride—age 9 
vice—life 9 weight—ideal 9 
joy—oath 8 glee—worth 

Verbs Verbs 


9 
bought—hopped 9 


sang—walked 10 
planned—shook 0 


wrote—threw 9 


brings—melts 8 slept—rang 9 
found—jump 9 flows—thinks 9 
looks—takes 9 pour—join 10 


dives—helps 10 opens—grows 
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One might have thought that the S would 
be able to unify the verb pairs by imagining 
an individual performing two actions at the 
same time—e.g., singing while walking. If 
this did occur it apparently was not as 
successful in establishing connections as 
was the conceptualization process involving 
nouns. 


Experiment VII 


In this experiment we put the organiza- 
tion thesis to a more direct test. If the 
hypothesis is correct, then material learned 
under conditions which encourage the 
formation of a unit should fare better than 
material presented under conditions which 
do not in any special way facilitate the 
occurrence of unit formation. 


Experiment VIIa 


Method and materials, Two lists of 6 pairs of 
concrete noun pairs were constructed. In one list 
these nouns were joined (or separated) by a third 
word which constituted an intelligible connective. 
Thus, we formed a list of three word phrases each 
of which comprised a unit, eg, bank on roof, 
lamp in bottle. A second list consisted of the same 
noun pairs but now joined by an irrelevant con- 
nective, that is, one which did not serve the pur- 
pose of unit formation, eg., bank late roof, lamp 
how bottle. In each list three different connectives 
were used, each serving twice. In the first list the 
words, on, in, near, were used. In the second list 
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the words how, but, late, were used. (See 
Table 12.) 

The instructions were as follows : 

This is an experiment to find out how well 
people learn certain materials. I am going to 
show you a series of six 3-word phrases. In each 
phrase the first and last words have been under- 
lined. I want you to read aloud the entire 
phrase as I present it to you and try to learn 
the underlined words in each of the phrases. 
That is, try to learn the first and last words 
in each phrase. After you have read through 
all six phrases once, I will show you the first 
underlined word of each phrase and ask you 
to tell me from memory what the second 
underlined word is in that phrase. You will not 
be asked to recall the middle word. I am only 
interested in how well you can learn the under- 
lined pairs. 

Each phrase was typed on a blank white index 
card and was presented for a 5-second interval. 
After one trial was completed 3 seconds were 
allowed to elapse while E randomized the order 
and made certain the S understood his task. The 
S was then shown the first word of each phrase 
and asked to respond with the correct missing 
underlined word. 

Subjects. Forty Ss served in this experiment. 
Group 1, consisting of 20 Ss, learned the first list, 
and Group 2, also consisting of 20 Ss, learned the 


second list. 

Results. The results for this experiment 
are presented in Tables 12 and 13. The 
mean number retained for Group 1 was 3.8 
(SD = 1.41). The mean number retained 
for Group 2 was 27 (SD — 1.64). This 
difference in favor of Group 1 is highly 


TABLE 11 


NUMBER ОЕ PAIRS LEARNED IN EXPERIMENT VI 


Series А Series B Series A & B 

Number Number Number 

Retained| Mesi SD Retained | Mean SD Retained | Mean 
Pictures 94 3.9 1.34 92 3.8 1.18 186 3.9 
ee Hou | № 31 1.70 71 3.0 1.33 146 3.0 
p eso | 45 1.9 1.30 52 2.2 1.34 97 2.0 
VE 55 2.3 1.67 56 2.8 1.52 111 2.3 
* 24 24 48 


alysis of vari: 


ance yielded an F ratio of 16.6 (an F of 2.65 required at the .01 level). A q test for the difference 


between the pictures and the abstract nouns was significant (9 = 1.9). The 


Note—A simple А iclded the following results (0 = 0.80 at the .01 level). The difference between the pictures and the concrete 
0. 


between the means У! 
nouns was significant (4. 
difference between the pict T was по sig 
Was significant (4 = 1.0). The d verbs was 

tween the concrete nouns ant 


— 0.90). The difference bets 
4 tures and the verbs was signi! 


ficant (а = 1.6). The difference between concrete nouns and abstract nouns 
nificant difference between the abstract nouns and the verbs (4 = 0.30). The difference 
significant at the .05 level (4 = 0.70). 


16 WILLIAM EPSTEIN, IRVIN ROCK, Ax» CARL B. ZUCKERMAN 


TABLE 12 


ITEM ANALYSIS оғ RESULTS FOR EXPERIMENT VIIa 
The Number of Ss Who Responded Correctly to Each Pair 


Group 1 (N = 20) Group 2 (N = 20) 
No. of No. of 
Material Corr. Resp. Material Corr. Resp: 
cake near road 13 cake but road ? 
box near cloud 11 box late cloud 8 
shoe on lake 12 shoe but lake 8 
bank on roof 13 bank late roof К ui 
lamp in bottle 14 lamp how bottle 
card in sand 13 card how sand 10 
Group 3 (М = 20) Group 4 (N = 20) 
[© 
No. of 09 h 
Material Corr. Resp. Material Corr. RES 
cake—road 8 cake and road i 
box—cloud 8 box and cloud 
shoe—lake 8 shoe without lake 9 
bank—roof 9 bank or roof 7 
lamp—bottle 8 lamp or bottle 9 
card—sand ví card without sand 8 
significant ( = 2.44, significant at the .01 


level). 


Before we accept these findings as evi- 
dence for the hypothesis, we must consider 
а possible objection. It can be argued that 
the Conditions of learning for Group 2 
actually inhibited performance and thus do 
not provide a fair test of our thesis. The 
middle word for Group 2 may have inter- 
fered with the link between the first and 


TABLE 13 


Nu 
MBER OF PAIRS LEARNED IN EXPERIMENT УПа 


№. 
N | Retained Mean SD 

Group 1 20 76 
Group 2 20 54 2 m 
Group 3 20 48 2.4 | {82 
Group 4 20 46 2.3 | 118 


Note.—Group 1 vs. 2:4 = 2.44; signi 
No difference between Group 2 and icant @ the 05 level. 
абет between Group 1 and Gi 
101 level. 


this effect for Group 1. An obvious annd 
would be that for this group the po 
word helped the learner form а unit ^. 
thus enhanced instead of inhibited the unt 
ing. This, however, is precisely the Pig 
To check on the validity of this critic i 
we decided to run a control group У gd 
would learn the same pairs as GrouP ind 
Group 2 without the presence of an}. perst 
of connective between the pair m tiv? 
(Group 3). If the irrelevant conn? сай! 
served to interfere with learning OT т to 
then eliminating the connective OS" gu 
make the control group performant% 
perior to the performance of GrouP amt 
Twenty adults drawn from the ze the 
population as Groups 1 and 2 serve cali 
Ss for this control group. The same “ror % 
ing conditions were maintained exceP” jon. 
change in the wording of the inst“ да 
to eliminate sections which did not 
to this design. 


we 
last words, thus lowering the scores: pave 
may ask why the middle word did not yet 


The mean number of pairs retained was 
| 24. "The standard deviation was 1.32. The 
D difference between the means for the Con- 

trol group and Group 2 was not significant 
(t — .60). 

A. second objection might be raised on 

| the basis of grammatical habits. Although 
the particular three-word sequences do not 

| occur very often, the connectives used in 
the word pairs learned by Group 1 created 
a triad more in accordance with grammatical 

| structure than was the case for Group 2. 

` For example, the phrase "lamp in bottle" 

| is a more natural grammatical sequence 
than "lamp how bottle." Moreover, the 
triads for Group 1 have a greater transi- 
tional probability than those for Group 2; 
some of the latter would represent a zero- 
order approximation to English (cf. Miller 
& Selfridge). Thus, both the grammatical 
habit and sequential probability hypotheses 
predict that the Group 1 material would be 
learned more readily. No reference to 
meaning would be necessary. 

To test this possibility, we repeated the 
experiment with the same word pairs but 
employed different words to connect them. 

. The connectives were chosen so that the 
triads would be as grammatically natural 
and would have the same transitional prob- 
ability as the material used for Group 1. 
However, these connectives would not aid 
the formation of conceptual units. The 
sequence “lamp or bottle" is as good gram- 
matically and is as probable as "lamp in 
bottle." But according to the organization 
viewpoint, the word “ог” does not facilitate 
unit formation. 

Materials and method. The materials 
used with Group 4 are shown in Table 12. 
The procedure was identical with that used 
for Groups 1 and 2. 

Subjects. Group 4 consisted of 20 college 
students. 

Results. 
learned was 2. 
of 1.18. This 
with the mean О 
The difference О 
the .01 level. Fur 
no better than Groups 


The mean number of pairs 
3 with a standard deviation 
result should be compared 
f 3.8 obtained by Group 1. 
f means is significant at 
thermore, Group 4 does 
2 or 3. 
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The experiment gives a decisive answer. 
Whatever role one assigns to grammatical 
habits and transitional probability in verbal 
learning, it is clear that these factors cannot 
fully account for the present results. The 
possibility of unit formation seems to be 
the crucial factor. 


Experiment VIIb 


In this experiment our purpose and de- 
sign was similar to Experiment VIIa, ex- 
cept that pictures were used. Group 1, 
consisting of 20 Ss, learned six pairs of 
outline pictures of familiar objects, e.g., 
card—glass. The pictures were drawn so 
that each pair constituted a unit. Thus, a 
“card in a glass” was presented. See Fig- 
ures 3 and 4 for reproductions of the pic- 
tures used in this experiment.’ Group 2, 
consisting of 20 Ss, learned the same six 


IRS 
Age | 


Fic. 3. Pairs learned by Group 2, Experiment 
VIIb. 


Fic. 4. Pairs learned by Group 1, Experiment 
УПЬ. 


* This technique of joining items into an integral 
unit is analogous to one first employed by Asch, 
Ceraso, and Heimer (1960). 
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pairs of pictures. For this group, however, 
the members of each pair were presented in 
the traditional paired-associate manner. 

The instructions given to both groups 
were: 

You will be shown six pairs of pictures of 
things you have seen often. I want you to try and 
learn these pictures, that is, try and learn which 
pictures are paired together. After we have gone 
through the list once, I will show you one of the 
pair and ask you to tell me verbally what you 
remember the missing partner to be. 

Each pair was presented for 5 seconds, 
and a 30-second interval followed the last 
presentetion. The test order was random- 
ized. 

Results. The results for this experiment 
are presented in Table 14. Group 1 learned 
a mean number of 49 pairs (SD — 1.35). 
Group 2 learned a mean number of 3.3 
pairs (SD = 1.26). This difference is highly 


significant (+ = 3.6, significant at the .01 
level). 


Discussiox 


, Our results show that pairs of familiar 
items are easier to associate than pairs of 
novel ones. Why this should be so is not 
entirely clear but one reason is surely the 
fact that with familiar material the S has 
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less to learn. Since he already knows the 
individual items, he is free to concentrate 
on forming a bond between them. 

In his discussion of the organization 
hypothesis, Kóhler does not mention the 
role of familiarity. It is clear, however, 
that emphasis on meaning is perfectly com- 
patible with acceptance of the importance 
of familiarity as well. The real issue, we 
believe, is whether the effect of meaning- 
fulness in learning is reducible to familiarity 
of any kind. In the series of experiments 
reported above we have attempted to prove 
that such a reduction is not possible. Mean- 
ing appears to be an independent factor. 

Although meaning is a result of past 
learning it enables the S to go beyond what 
he has learned. Similarly, while insight 1” 
problem solving may be based on past ex- 
perience, it is not simply a reproduction of 
the past but represents something new: 
When the .$ knows the meanings of words 
he can “play” with them and put them into 
new combinations. This possibility makes 
them easier to associate than nonsense 
syllables or familiar but less meaningful 
items. When we refer to these combinations 
as conceptual units we do not wish t? 
imply that such units are sensible or logical. 
They are conceivable or imaginable. 


TABLE 14 


RESULTs OF EXPERIMENT УПЬ 


Pairs of Pictures Learned 


Material Group 1 (М = 20) Group 2 (N = 20) 

pipe—clock 

card—glass B 1 : 

chair—tree 18 11 

bottle—wall 16 

pencil—lamp 15 12 

bowl—hand 16 9 
Total 98 56 

=. 4.9 3.3 

ой 1.35 1.26 

tg — 22 


3.6** 


** Significant at .01 level. 


A, 
^ 


MEANING AND FAMILIARITY 


Many investigators, in recent years, have 
perhaps too hastily concluded that the re- 
sults of their experiments make unnecessary 
any reference to the role of meaning as we 
have described it. But as we have already 
stated, evidence in favor of a habit ex- 
planation does not prove that this is the 
only possible explanation. 

For example, according to Noble (1952a) 
"meanings are habits" (p. 429). The greater 
the number of responses conditioned to а 
particular stimulus the more meaningful is 
this stimulus. He advances an operational 
index of stimulus meaning which is the 
"mean frequency of continued written asso- 
ciations made by subjects within a 60 sec. 
time interval" (1952a, p. 429). ^ Noble 
assumes that this index (m value) is an 
important variable in verbal learning. А 
serial learning experiment (1952b) showed 
that m value was positively correlated with 
case of learning. Presumably the facilitative 
effect of high m value would follow from 
the associationist explanation outlined above 
on page 1. 

If we were to apply Noble's т value (or 
Mandler & Huttenlocher's [1956] f value) 
index to our material, it might be assumed 
that concrete nouns would elicit more asso- 
ciations than conjunctions ог abstract nouns. 
Informal tests confirm this assumption. 
'Thus, the superiority of concrete nouns 
might be explained as the outcome of high 
m values. Yet our explanation cannot be 
ruled out, unless one can find a situation 
for which the two hypotheses make different 
predictions. Noble's hypothesis would pre- 
dict that the concrete nouns should be 
learned more readily, not only as paired 
associates but in serial learning as well. 
On the basis of the organization view, we 
would argue that, in the learning of a 
series, the formation of conceptual units 
would be quite difficult, since the words are 
presented in a continuous sequence. There- 


ates clang and tangential associa- 
to the meaning of the stimulus 
dure shows that m value is pa 
identical with meaning; these ее. d 
pendent on SS inowledgé of the man S uu 
word, It is not clear what associations to 

nonsense syllable signify. 


* Noble elimin: 
tions unrelated 
term. This proce 


IN ASSOCIATIVE LEARNING 19 


fore, the concrete nouns might do no better 
than other kinds of material, especially 
when only one or two presentations are 
given. Preliminary data obtained by John 
Catalano? yield the following means for 
the number of items recalled by 23 5$ after 
one learning trial each (using a test of free 
recall) with a list of 12 concrete nouns, a 
list of 12 conjunctions and prepositions, 
and one of abstract nouns: 8.9, 82, and 
8.0, respectively. These results are contrary 
to the m value hypothesis. Further research 
on this problem is in progress. 

To explain the superiority of picture pairs 
over the other kinds of material and the 
superiority of concrete nouns over verbs 
found in Experiment 6, in terms of associa- 
tive value, it would be necessary to obtain 
corresponding differences in т value. This 
seems unlikely. Moreover, the fact that the 
pairs in List A of Experiment 5 are not 
as well learned as noun pairs argues against 
the Mandler and Huttenlocher interpreta- 
tion. In this case, the words have been 
linked together frequently so that no medi- 
ating bridges would be needed to aid in the 
formation of a bond. 

While Noble and Mandler and Hutten- 
locher agree that meaning, as defined by 
them, does benefit learning, Sheffield (1946) 
finds no facilitative effect. This failure may 
be an outcome of the way in which he 
attempted to make a term meaningful. At- 
taching such responses as blacking out the 
word or putting a staple through it does not 
provide the kind of meaning that would 
favor the formation of associations. 

Another factor in the effect of meaning- 
fulness which we have not discussed has 
been suggested by Sheffield. Meaning, as 
an internal response, facilitates learning 
because of its role in making for stimulus 
differentiation. The words "cup" and "cut" 
as sheer sounds are very similar. But the 
unique internal responses attached to each 
differentiates the two items and thus oper- 
ates to reduce interference. However, 
Sheffield's own data do not support this 
hypothesis since meaning did not yield any 
benefit in addition to familiarity. Nor does 


— 
1 Brooklyn College. 


20 


this view account for the differences ob- 
tained in Experiments 6 and 7.5 


Another example where results support- 
ing their hypothesis led the investigators 
to conclude that meaning is not an inde- 
pendent factor is provided by the Miller- 
Selfridge experiment. These authors em- 
phasize their finding that some sequences 
of a third or fourth order of approximation 
are learned as well as text passages. They 
interpret this result as evidence that learn- 
ing depends not on the meaningful nature 
of the sequence but on the presence of 
familiar short range associations between 
words. 

It is interesting to find that this experi- 
ment was repeated with some modification 
by Marks and Jack (1951). Instead of 
testing by means of free recall, they re- 
quired the Ss to reproduce the words in the 
same order in which they had been pre- 
sented during learning. They argue that 
this method provides a better test of the 
Miller and Selfridge hypothesis which 
Stresses short range association. In con- 
tradition to the earlier experiment, Marks 
and Jack found that “recall seems to be 
very much a function of the a priori, rela- 
tive meaningfulness of the materials” (p. 
300). Р 
Intuitively, it did not seem possible to us 
that text passages would not be learned 
more easily than some of the lower orders 
of approximation. Therefore, we selected 
two passages from the Miller and Selfridge 
material which according to their result 
were learned as well as text and repeated 
their experiment with the identical pro- 
cedure. The results for 16 college students 
were: 

i Twenty-word 


We passage, third-order ap- 
proximation—520;, 


recall. 


* Another possible 


findings, which is suggested by Sheffield, is that 
meaningful material 5 ' 

involving inter- 
, "pen—hand" 
to the learner. 
could occur or 


i е, it is not ob- 
vious how the results of Experiments V and VI 
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Twenty-word text passage—78% recall. 
2. Thirty-word, fourth-order approxima- 
tion—58% recall. 


Thirty-word text passage—72% recall. 


We have no explanation for the discrep: 
апсу but our results, together with those 0 
Marks and Jack, suggest that Miller an 
Selfridge have not ruled out meaning аз 2 
factor in verbal learning. 1 
pointed out, in addition, that their own 
data might be explained in terms of “sho 
range meanings" as well as short rang? 
associations. d 

One final example : grammatical sequis 
of words can be learned better than ran pe 
sequences of the same length. This € 
a result of transfer from previously esta? 
lished grammatical habits. However: 
grammatical sequence is usually meaning 
(of course in a more complex sense к. 
is the case for an isolated word) an 
role of meaning may be independent ^ 
grammatical habits. Again, evidence in SÊ 
port of one factor does not eliminate ss 
other. Experiments analogous to the 9 2. 
reported in this paper could perhaps 255 je: 
the separate effects of each. For exam 0 
$5 might be given the task of learnin£ 
passages equally well structured 
matically but differing in meaningf u!” 
(Perhaps a passage from Hegel ап 
from Bertrand Russell?) ей 

ТЕ is important to clarify the sens" in 
which we use the term “organizatiO? „ye 
connection with Kéhler’s hypothesis: itey 
would agree with Postman an ind’ 
(1957) that in the case of verbal lear? er 
the term "organization" as it is used 11 o? 
ception, with its implications relatin£ we 
specific brain model, is inappropriat fet 
use it in the present context only © anle 
descriptively to the possibility which Е ds it 


discusses, namely, that the learner ай“ 
easy to relate meaningful items to. 05 в № 
other, to imagine the component it€ ай 


some relation ог juxtaposition to ont dee 
other. Although this is intuitively U^, to 
standable, further research is require ag 
clarify why it is easier to organize 
ingful items than other types of item* 


It should be | 
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SUMMARY 


The roles of familiarity and meaning in 
associative learning were investigated. 

Experiment I revealed the effect of 
familiarity: prefamiliarized nonsense sylla- 
bles were learned better than novel ones. 

In Experiment П a group of Ss learned 
three lists of six pairs of words: a list 
of nouns, a list of three conjunction pairs 
and three preposition pairs, and a list of 
nonsense syllable pairs. The nouns were 
considered to be meaningful and familiar, 
conjunctions and prepositions primarily 
familiar, and syllables neither familiar nor 
meaningful. The results showed that nouns 
were better learned than conjunctions and 
prepositions, and these in turn were better 
learned than the syllables. 

Experiment III was an attempt to localize 
the influence of meaning and familiarity. 
Experiment II, therefore, was repeated, this 
time using matching instead of reproduction 
to test learning. Although the absolute level 
of performance was enhanced as compared 
with Experiment I, the relative differences 
between materials remained the same. We 
concluded that in our experiments meaning 
and familiarity operated almost entirely to 
facilitate the formation of the associations. 
Experiment IV was designed to deter- 
mine whether the advantage of conjunc- 
tions and prepositions in Experiments II 
and III is due to familiarity or to limited 
Prefamiliarized nonsense sylla- 
bles were learned as well as conjunctions 
and prepositions, showing that familiarity 
was the crucial factor. 

Experiment V provided evidence relating 
to the role of sequential probability in our 
experiments. The results indicated that 
while this factor does facilitate learning to 
some extent, it is not sufficient to account 
for our earlier findings. . 

In Experiment VI we tested the possi- 
bility that meaningful material allows for 


meaning. 


the formation of conceptualizable units 
which combine the separate items into a 
new whole. The Ss learned four lists com- 
posed of six pairs each: a list of pictures of 
real objects, a list of concrete nouns, a list 
of verbs, and a list of abstract nouns. We 
assume that these varied in respect to the 
readiness with which they could be formed 
into novel units. The results showed that 
pictures were learned better than concrete 
nouns which in turn were superior to verbs 
and abstract nouns. The verbs and abstract 
nouns did not differ significantly. 

In Experiment VIIa we investigated fur- 
ther the hypothesis that the conceptualiza- 
tion of objects is facilitating because it 
encourages the formation of units. Two 
groups of Ss learned six pairs of concrete 
nouns. Group 1 learned the pairs as parts 
of a three-word phrase in which the middle 
word was an intelligible connective under 
conditions which encouraged unit forma- 
tion. Group 2 learned the same pairs as 
parts of a three-word phrase where the 
middle word did not reasonably connect the 
two members of the pair. Performance for 
Group 1 was superior to Group 2. A third 
group served as a control. This group 
learned the same six pairs but with no con- 
nective whatsoever. The performance of 
this group was not significantly different 
from Group 2. А fourth group learned 
phrases which formed sound grammatical 
units but did not form good conceptual 
units. This group was not significantly dif- 
ferent from the control. 

In Experiment VIIb we used six pairs 
of pictures of familiar objects. For Group 1 
the two pictures of each pair were presented 
as one unit. For Group 2 the pairs were 
presented in the traditional paired-associate 
manner. Group 1 was highly superior to 
Group 2. 

These experiments were considered in 
their relation to the explanations offered to 
account for the role of meaningfulness. 


22 WILLIAM EPSTEIN, IRVIN ROCK, ann CARL В. ZUCKERMAN 


з 


REFERENCES i 


AscH, S. E., Ceraso, J., & Hermer, W. Per- 
ceptual conditions of association. Psychol. 
Monogr., 1960, 74, (3, Whole No. 490). 

Сномзку, N. Syntactic structures. The Hague: 
Mouton & Co., 1957. 


GLAZE, J. A. The association value of nonsense 
syllables. J. genet. Psychol., 1928, 35, 255-267. 


Frs, С. F. The structure of English. New York: 
Harcourt, Brace, 1952, 


Номамр, C, & Kurtz, К. Н. Experimental 
studies in rote-learning theory: X prelearning 
syllable familiarization and the length-difficulty 
relationship. J. exp. Psychol., 1952, 44, 31-39. 

Кбншв, W. Gestalt psychology. New York: 
Liveright, 1929. 


Контев, W. Dynamics in psychology. New York: 
Liveright, 1940. 


MANDLER, G, & HurrENLocHrm, J. The relation- 
ship between associative frequency, associative 
ability and paired associate learning. Amer. J. 
Psychol., 1956, 69, 424—428. 

Marks, M. R, & Jacx, O. Verbal context and 
memory span for meaningful material. Amer. J. 
Psychol., 1952, 65, 298-300. 

McGeocn, J. A, & IRION, А. L. The psychology 
of human learning. New York: Longmans, 1952. 

Miter, С. A., & SELFRIDGE, J. А. Verbal context 


and the recall of meaningful material. Amer. J. 
Psychol., 1950, 63, 176-185, 


Мове, С. E. An analysis of meaning. Psychol. | 
Rev., 1952, 59, 421-430. (a) 

Noste, C. E. The role of stimulus meaning (m) 
in serial verbal learning. J. exp. Psychol, 19 
43, 437—446. (b) i | 

Озсоор, C. E. Method and theory їп experimento 
psychology. New York: Oxford Univer. Press, 
1953. 

Оѕсоор, С. Е. A behavioristic analysis of percep - 
tion and language as cognitive phenomena. 1 
Contemporary approaches to cognition: A reh 0, 
of a symposium at the University of Color! d 
Мау 12-14, 1955. Cambridge, Mass.: H 
Univer. Press, 1957. е o 

Postman, L, & Rumey, D. A. А criti or 
Kóhler's association theory. Psychol. Rev» 
64, 61-72. 

ROBINSON, E. S. Association theory today. 
York: Appleton-Century, 1932. 2 of 

SHEFFIELD, Е. D. The role of meaningfulness jp- 
stimulus and response in verbal learning: jvef 
published doctoral dissertation, Yale Un 
1946. е 

THORNDIKE, Е. L., & Lorce, I, The teachers 
book of 30,000 words, New York: Bure? 
Publications, Teachers College, 1944. ab 

Waters, R. H. The law of acquaintance. 
Psychol., 1939, 24, 180-191. 


New 


word 
u of 


(Received March 20, 1958) 


/————————————— _— 


| 
| 


| Vol. 74, No. 5 


Whole No. 492, 1960 


Psychological Monographs: General and Applied 


RELATIVE EFFECTIVENESS OF 
PSYCHOTHERAPEUTIC PROGRAMS: 
A MULTICRITERIA COMPARISON OF FOUR PROGRAMS 


FOR THREE DIFFERENT PATIENT GROUPS? 


С. W. FAIRWEATHER,? R. 
J. L. HOLLAND,! R. $ 


SIMON; M. E. GEBHARD, E. WEINGARTEN, 


G. B. STONE, and J. E. REAHL 


Veterans Administration Hospital, Perry Point, Maryland 


SYCHOTHERAPISTS in neuropsychiatric 

hospitals are continually faced with the 
problem of formulating feasible and eco- 
nomical psychotherapeutic programs for a 
heterogeneous population of patients. His- 
torically, such programs have usually been 
established upon the theoretical propositions 
accepted by those in positions of authority 
and, frequently, little attempt has been made 
to evaluate their effectiveness. In addition, 
when such evaluations have been carried 
out, a single criterion is selected and its 
relationship to other critieria has been 
ignored. The present study attempts to 
compare the relative efficiency of four com- 
mon psychotherapeutic programs for three 
patient groups using a number of diverse 
inventories, projective devices, and rating 
scales. 

Probably the most common treatment 
program in neuropsychiatric hospitals today 
provides each patient with an individual 
work assignment and planning for post- 
hospital adjustment. This program is based 
upon the assumption that the hospital work 
situation can be therapeutic. For that rea- 


1 The authors wish to express their appreciation 
to the entire staff of VA hospital, Perry Point, 
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Sarah Shifllet, Anna Wirsing, Grace Pruitt, and 

ms nursing assistants, and Mary Mc- 


James Sprouse, r 
e nurse, who served on the experimental 
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od Boyd, uh Sal ui, L. Arlene "eh 
Marian Barry, and Lucy Yoshizawa. Ed Rn 
Jefferys, Marge Edelson, and John Моав we 
express gratitude for their social work services. 

€ are especially indebted to Earl Brannon, 


son, the hospital personnel attempt to place 
the patient in a work situation suited to his 
unique skills and needs. This opportunity 
for maximal use of his abilities in a job 
setting may enhance the patient's self- 
esteem and result in more adaptive behavior 
in the extramural setting. 

A second common treatment program em- 
ploys individual work assignments, individ- 
ual psychotherapy, and departure planning. 
This assumes that underlying conflicts must 
be resolved before any effective personality 
change can occur and, further, that per- 
sonality change is essential for extramural 
adjustment. It assumes too that the rela- 
tionship between the psychotherapist and 
patient is the most effective instrument for 
resolving unconscious conflicts. A host of 
publications and documents signify that 
individual psychotherapy has, indeed, be- 
come one of the most common psycho- 
therapeutic approaches of our time. 

A third treatment program uses individ- 
ual work assignments, group psychotherapy, 
and departure planning. It differs from the 
two preceding programs only with respect 


Manager, and Willy Oppler, Chief of Professional 
Services, for their administrative assistance in 
carrying out this project. We are grateful to 
Roy Е. Buehler for his contributions to the theo- 
retical aspect and Laura Kent for her aid in 
preparation of the manuscript. 

* Now with VA Hospital, Palo Alto, California. 

3 Now with National Institutes of Health, 
Bethesda, Maryland. 

! Now with National Merit Scholarship Cor- 
poration, Evanston, Illinois. 

5 Now with Philadelphia State Hospital, Phila- 
delphia, Pennsylvania. 


2 FAIRWEATHER, SIMON, GEBHARD, WEINGARTEN, HOLLAND, SANDERS, STONE, REANL 


to group psychotherapy. Proponents of 
group psychotherapy, as the medium for 
personality change, maintain that, since 
man has learned his maladaptive responses 
in a social milieu, the most effective vehicle 
for changing such responses is group psy- 
chotherapy. In this program the situation 
for personality change is the pattern of 
interpersonal relationships occurring in 
group therapy. 

The fourth program, albeit used less fre- 
quently than the preceding three, is a social 
interaction, or “group living therapy.” This 
rests upon the proposition that the total life 
situation of the patient can be used as an 
instrumentality for personality change. 
Therefore, the patients live and work to- 
gether, and, as an integral part of this pro- 
gram, participate in group psychotherapy. 
This program differs from the other three 
in that all its functions are group oriented. 
The rationale is that changes in maladaptive 
perception and subsequent behaviors are 
best effected by planned social situations 
which provide an opportunity for correct- 
ing inappropriate interpersonal responses. 
Each situation provides an atmosphere for 
group decision and responsibility conducive 
to group cohesiveness. It is the cohesive- 
ness of the group which mirrors the ac- 
cepted norms of the society and provides 
the background for corrective emotional 
experiences. In this atmosphere, all group 
activities are believed to be potentially 
therapeutic. 

In general, psychotherapeutic programs 
such as those described have been more 
successful with neurotic and acute psychotic 
patients than with chronic psychotic pa- 
tients. 

F rom the aforementioned experience and 
rationale two general hypotheses evolve. 
They are: 


1 "Treatment programs involving psycho- 
therapy effect more change in the direction 
of adaptability than work programs alone. 

2. Nonpsychotics and short-term psy- 
chotics change more in the direction of 
greater adaptability with any treatment 
program than long-term psychotics, 


b 
PROCEDURE 

The experiment took place on a ward in | 

a large Veterans Administration neuro | 
psychiatrie hospital. The number of patients 
on the ward at a time varied from 40 to 9^ 
The ward staff included а psychiatrist, ® 
psychologists, à social worker, а psychologt 
cal intern, a nurse, and four nursing assist- 
ants. As for its physical features, there 
were three large dormitories, а recreation 
ig and visitors room, and @ 
пе stu y 


room, a reading 

television room. АП patients in the. E 
: { 3 Wa. 

remained on this ward for the duratto! 


their treatment. А T 
The general ward program for 47 u, 


tients in the experiment included respons 
bility for ward housekeeping and, "els 
tion, two recreational evenings per Me. 
The first of these took place on Sunday 2 
Monday and included а structured Te 

tional situation. During these рей 
member of the special services divis" 
the hospital, along with a group of № nne 
volunteers from the community. P 
recreational situations in advance ere 
dances, table games, etc. The second TE os; 
ational situation, on Thursday even еп 
was an unstructured situation: the \ ү 
volunteers arrived on the ward ant 
gether with the patients, planned 
tional activities for the evening. "ugly 
informal groups were formed spontane? od | 
They played cards, danced. and ene atl 
in similar activities. 


ods а 


ion 


In addition, the |, 12 
recreation rooms were open from | 
in the morning and from 3 to 90 e 
the evening. When the patients MC 
participating in their specific treatment | of 
grams, they could watch TV, play Dae 
cards, read, and engage in other acl of 
During this time a nursing assistant "^" 
duty. 


ck 


The Four Experimental Conditions 


In the first treatment program, = Tat 
team? provided each patient with an nos? 
work assignment and planning for PO 


9 Consisting of а psychologist, 
counseling psychologist, social worker, 2 
logical intern. Any further use of the, а. 
habilitation team" refers to these individu? 
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living. This group constitutes the control group 
for the study and will be referred to as Group C. 
For the second group (Group Г), the treatment 
program consisted of an individual work assign- 
ment, individual psychotherapy, and departure 
planning outlined by the rehabilitation team. The 
treatment program for the third group (Group G) 
included an individual work assignment, group 
psychotherapy, and departure planning by the re- 
habilitation team. For the fourth group (Group 
GG), the treatment consisted of a group work 
situation, group living situation, and group psycho- 
therapy. 

Upon their arrival on the ward, members of the 
C, G, and I groups were interviewed by a counsel- 
ing psychologist and given individual work assign- 
ments, In addition, the G and I groups were given 
therapy appointments. Groups C, G, and I were 
responsible for ward housekeeping in all dormi- 
tories except the one occupied by the GG group. 
For these three groups, each patient was assigned 
to ward housekeeping {ог a period of two weeks. 
This was done on a rotating basis. 

The GG group lived together in а single ward 
dormitory. Their work assignment was to produce 
a hospital newspaper every two weeks. Each 
afternoon at 1 o'clock they reported to a room 
which had been set aside as the newspaper work- 
room. Here, under the supervision of a staff mem- 
ber, they had the total responsibility for running 
the newspaper: they made decisions about the 
content of the paper, the reporting of the articles, 
its reproduction, and all other matters relating to 
the administration and design of the paper. The 
first hour of each afternoon was devoted to a 
discussion of their work. In addition, they re- 
ceived group therapy and were responsible as а 
group for the housekeeping in their dormitory. 


Therapists and Therapy 


Two staff psychologists served as ther- 
apists for the patients in the G and GG 
groups. One of these had four years of 
postdoctoral psychotherapeutic experience 
and the other had two years. The 48 pa- 
tients in these two groups were divided 
evenly between the therapists. 

The individual therapy patients (Group 
Г) were carried by 15 psychotherapists with 
a wide range of atr Five clinical 

‚ staff members, whose range о 
He pe varied. from two to five post- 
doctoral years, provided individual psycho- 
therapy. One staff psychiatrist, with six 
years of psychotherapeutie experience, x 
ried an individual therapy patient. In addi- 
tion, four clinical psychology traces: ex d 
four psychiatric residents provided individ- 


ual psychotherapy. All were supervised by 
consultants. 

Both the group psychotherapy and in- 
dividual psychotherapy sessions were psy- 
choanalytically oriented. Group sessions 
were held for 1—1/2 hours two times 
weekly and individual therapy sessions 
varied from two to four per week depend- 
ing upon the judgment of the psychother- 
apist. Groups averaged six patients and 
ranged from four to eight. The groups were 
open groups and usually consisted of neu- 
rotics, acute psychotics, and chronic psy- 
chotics. 


Criterion for Discharge or Failure 


At the weekly meeting of the rehabilita- 
tion team, each patient's current hospital 
adjustment was discussed and his readiness 
for discharge was evaluated. No patient 
left the treatment program unless it had 
been unanimously agreed that he was ready 
for discharge. In addition to the 85 pa- 
tients discharged, there were 11 who were 
considered failures. The criterion for fail- 
ure was the team’s decision that the patient 
had shown no progress, or had shown a 
decrement in his general adjustment after 
six months of treatment. Such patients 
were considered failures and were sent from 
the experimental ward to other hospital 
wards where a more appropriate treatment 
program was possible. 


Sampling Procedures 


When the program started, each patient coming 
to the ward was randomly assigned to one of the 
four treatment groups. After a sufficient sample 
in each group had been accumulated by random 
assignment, new patients were placed by matching 
across treatments on age, diagnosis, and length of 
hospitalization. When new patients who did not 
meet the matching criteria arrived on the ward, 
they were randomly assigned to one of the four 
treatment groups. Thus, over a period of time a 
treatments by diagnoses factorial analysis of vari- 
ance design was attained with 24 patients in each 
of the four treatment groups and 32 patients in 
each of the three diagnostic groups. 

The four treatments have been discussed. The 
three diagnostic groups were: (a) nonpsychotic, 
(b) short-term psychotic, and (c) long-term 


4 FAIRWEATHER, SIMON, GEBHARD, WEINGARTEN, HOLLAND, SANDERS, STONE, REAHL 


psychotic." Each patient in the nonpsychotic group 
had been diagnosed a nonpsychotic. Each patient 
in the short-term psychotic group had less than 
one year of previous hospitalization in addition to 
his psychotic diagnosis. Each Patient in the long- 
term psychotic group had a psychotic diagnosis 
and previous hospitalization exceeding one year. 

A descriptive comparison of the treatment and 
diagnostic groups appears in Table 1 and 2, 
respectively. 

Table 1 indicates that the treatment groups are 
quite homogeneous with respect to diagnosis, 
amount of drugs received, age, education, race, 
length of previous hospitalization, pensions for 
NP illness, age of first hospitalization, employment 
history, occupational Status, and marital status. 

On the other hand, Table 2 reveals significant 
difference among the three diagnostic groups. The 
stratifying variable of previous weeks of NP 


T Diagnostic groups are listed in the tables as 
follows: nonpsychotics (N), short-term psychotics 
(Ps), and long-term psychotics (РІ). 


hospitalization significantly differentiates the three 
diagnostic groups as do the pension, employment, 
and marital status variables. No significant differ- 
ences are found for age, education, race, age 0 
first NP hospitalization, weekly salary, occupa- 
tional status, or percentage of patients on tram 
quilizing medications. Viewed collectively, the йч 
ferences suggest that the long-term psychotic 
more maladaptive than either the nonpsychotics 0 
short-term psychotics. 


Experimental Instruments and 
Testing Situation 


In selecting criteria for evaluating change, 0 
experimenters attempted to include measures om- 
were frequently used for this purpose an areas 
passed a broad range of behavior. „пея 
selected for assessment included individu? 
havior, self-evaluation, fantasy, and 
Specifically the instruments were: Ward Ве eren 
MMPI, Q sort, Holland Vocational Pret ing 
Inventory (HVPI), TAT and a Follow-up 


TABLE 1 


COMPARISON ОЕ THE Four TREATMENT GROUPS ON DEMOGRAPHIC CHARACTERISTICS 


Test die 
Variable GG G I C | Sig 
Matching Variables 
Diagnoses 
% character disorders 12.5 12.5 12.5 12.8 0 
% neurotics 20.8 20.8 20.8 20.8 х= 
% psychotics 66.7 66.7 66.7 66.7 
Mean age 7 36.25 33.17 34.21 34.92 т 
ean weeks of prior NP hospitalization 51.6 49.2 53.8 54.8 кеч 
N — Variable 1.58 
о receiving tranquilizing drugs 54 58 кт 
Demographic Variables Á Ы - 
Education 
Mean grade completed Ler 
б plete 10.5 10.6 11.0 11.2 
Z% white 79 75 75 75 "E i 
v, _® Negro 21 25 25 25 1.52 
% pensioned for NP illness 67 50 67 58 mee 
Mean age of first NP hospitalization 28.5 27 29 31 е 
Employment 
Ma % se employed in past 10 years | 71.5 75 68 71 Dee 
ean weekly salary of last job ES a 
Occupational status TERIS 8 Ре "a S 
% professional-managerial 12.5 2 8 
% skilled trades, sales, clerical 33.3 P zd 8 a x? a) 
% unskilled workers 54.2 58.4 54.2 62.5 
Marital Status ` 2 у : 
% currently married 25 3 5 2 8 
% single or divorced 75 a - d 


в The F ratio value. . 


- indicates that the within-group variance was greater than the between-group variance. 
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TABLE 2 


COMPARISON OF THE THREE DIAGNOSTIC GROUPS ON DEMOGRAPHIC CHARACTERISTICS 


3 Test of 
Variable N PS PL Significance 
Matching Variables 
Mean age 36.09 34.41 33.31 F=.. 2 
Mean weeks of prior NP hospitalization 25.34 20.47 111.22 Е = 45.91** 
Concurrent Variables 
% receiving tranquilizing drugs 37.5 56.3 65.6 x? = 527 
Demographic Variables 
Education 
Mean grade completed 11.03 10.34 11.09 I mi. sn 
Race 
% white 81.3 65.6 81.3 j 
% Negro 18.7 34.4 18.7 х REE 
% pensioned for NP illness 37.5 59.4 90.6 x? = 19.46** 
Mean age of first NP hospitalization 31.19 29.75 25.88 F = 3.06 
Employment 
Mean % time employed in past 10 years 19.22 83.50 51.09 Е = 17.02** 
Mean weekly salary of last job 68.06 73.12 56.13 F = 2.37 
Occupational status 
% professional-managerial 12.5 21.9 21.9 
% skilled trades, sales, clerical 34.4 18.8 18.8 x? = 3.32 
% unskilled workers 53.1 59.3 59.3 
Marital status 
% currently married 28.1 46.9 6.2 x? = 13.39* 
% single or divorced 74.9 53.1 93.8 T MS 
a The F ratio value . . . indicates that the within-group variance was greater than the between-group variance. 


*.01 level of significance. 
**,001 level of significance. 


Scale.8 Their construction and scoring will be 
discussed in subsequent sections devoted to the 
findings obtained with each assessment device. 
Every patient in the study was tested twice with 
the instruments listed above. The first assessment 
was made at the time the patient transferred to 
the experimental ward and the second shortly be- 
fore he left the ward. The tests were administered 
during scheduled sessions in an office building 
which was located some distance from the ward. 
The same examiner administered both initial and 
terminal tests. The initial testing occurred, with 
a few exceptions, on the first Monday afternoon 
and Tuesday morning following a patient's trans- 
fer to the experimental ward. At that time the 
patient Was instructed by the ward nurse to report 


ior Scale, Q sort, and Follow- 
ited with the American 
up Scale have been deposite 
Documentation Institute. Order Document No. 
6239 from the A.D.I. Auxiliary Publication Proj- 
ect, Photoduplication Service, Library of Congress, 
Washington 25, D. C., remitting in advance = 
for photoprints or $1.75 for 35 mm. microfilm. 
Make checks payable to Chief, Photoduplication 


Service, Library of Congress. 


з The Ward Behav: 


to the office building "for testing." In a few cases, 
due to uncontrollable circumstances such as illness, 
the time interval between transfer *o the ward and 
testing was somewhat longer, though it never 
exceeded two weeks. The second administration 
usually was scheduled for the Monday afternoon 
and Tuesday morning of the week during which 
the patient was to leave the experimental program. 
Here again there were occasional exceptions. 

The tests were administered to groups which 
averaged three or four patients and ranged from 
two to nine. Groups undergoing initial and ter- 
minal testing worked in different rooms. 

The following instructions were given for the 
initial testing : 

All of you have recently been transferred 
to Ward 19A from other parts of the hospital. 
Part of the routine procedure connected with 
transfer to 19A is some psychological testing 
and this is why you were asked to come over 
here today. The purpose of these tests is to 
provide information about you which will be of 
help to your doctors on the ward in helping 
you. It is therefore in your interest to cooperate 
to the best of your ability in doing these things 
which you will be asked to do, and I hope you 
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will do so. Now, some of you may have done 
some of these things before, either when you 
first came to Perry Point, or at some other time. 
But, even so, even if you have done them only 
a few days ago we would like you to do them 
again, since it is important that we get the most 
accurate picture that we can of you as you are 
now, as you are entering 19A. There are several 
things which you will be asked to do and these 
will take, probably, all of the afternoon and part 
of tomorrow morning. Unless there are any 
questions, we will begin. 


The examiner then distributed the Q sort cards 
and place cards, drawing the most-like-6 and 
least-like-6 cards out of the stack and placing 
them separately in front of the patient. He then 
proceeded with the instructions as follows: 

I have just placed before each of you a stack 
of cards, each of which has a typewritten state- 
ment on it. What I would like you to do is 
to sort these cards according to the way the 
Statements apply or don’t apply to you. The 
way I want you to start is to go through the 
whole stack, reading all the statements. Out of 
these cards you are to select the six which are 
most like you the way you are, which are most 
true of you, and to put them on the card which 
Says most-like-6. Also select the six statements 
which are least like you the way you are, which 
are least true of you, and put them on the card 
marked least-like-6. Now remember, in doing 


this you are supposed to go through the whole 
Stack of cards. 


After a patient had completed the first sort, 
the examiner counted the cards he had placed 
in each pile, attached them and placed them in 
& separate envelope for each patient. Then he 


drew the most-like-9 and least-like-9 cards from 
the stack and said : 


Now go through the remainder of the cards 
and, out of all the cards you have left, select 
the nine statements which are most like you and 
the nine which are least like you and put them 


on these cards, Again, go through all the cards 
you have left. 


This sorting was repeated twice more with 15 
and 20 cards each. 


After a patient had completed the Q sort and 
the examiner had checked the numbers in each 
pile, an HVPI booklet and answer sheet were 
placed before him and the examiner said: 

The next thing you will do is to go through 
this list of jobs or occupations and indicate on 
the answer sheet your first impression as to 
whether a particular job or occupation appeals 
to you. If it does appeal to you, black in between 
the dotted lines under the column with the head- 
ing A. A stands for appeal If it does not, 
black it in under heading D. Just go on the 
basis of your first impression, without spending 
much time thinking about any one item, Put 
your name in this space before you begin. 


After a patient had completed this test the 
examiner put before him ап MMPI booklet and 
answer sheet saying: А 

This is the last thing which we will do this 
afternoon. This booklet contains a number of 
statements and I want you to indicate whether 
you believe them to be true or false of you by 
making a mark under the column headed T ! 
true, F if false. 

At the end of the afternoon session, the patients 
were requested to return the following morning: 
At that time the examiner again distributed the 
Q sort material and said: 

You remember that the first thing we did yes- 
terday afternoon was to sort these cards into 
different piles according to whether the sto d 
ments on them applied to you or didn't. TE 
morning I would like you to sort them i. 
but on a different principle. Instead of eorth 
them according to the way you are, you n 
sort them according to the way you would 1 
to be. This means that, in the first step, ld 
should pick out the six statements that you bie 
most like to be like, that you would most like 
be true of you, regardless of whether they ar 
actually true of you or not, and the six that У 
would least like to be true of you regardles 
of whether they are true of you or not. i 
example, let's assume that one of the statemen 
is, "I have a million dollars.” Yesterday УЗ 
would have put that in one of the least-like pie 
since that isn't true of you. Today you mig" 
possibly put it in one of the most-like piles since 
although it isn't true of you, you might W! E 
that it were true of you. Now, are you all cleat 
on this difference between today’s and yeto 
day's sort? Again, you are to go through f" 
whole stack each time you make a sort. 

After completion of the initial step the patiens 
were asked to sort 9, 15, and 20 statements €? а 
into most-like and least-like piles, The ргосед И" 
was the same as in the initial sort, 


чө» jal 
The examiner then distributed the TAT materi 
and said: 


This is the last thing you will be asked t 
I have given you each a series of pictures W a 
show one or more persons in some kind Bias 
setting. What you are to do in this is to WÎ 
а story about each picture, а story which echt 
in it the answers to these questions: What mE 
be happening? What might be going on? bor 
might the people be thinking and feeling? wh t- 
might have led up to this? What might the Pu 
come be? You should not spend more that yt 
minutes on any one story after you have йош 
of something. Please put your name on 9 
sheet of paper you use. 


After a patient had completed the TAT he 
dismissed. 


do. 
o 
hich 


was 


tration 


0 p 
dis- 


The instructions for the second adminis | 
varied, depending upon whether the group 
examined consisted entirely of prospective 
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charges or whether it included "failures" who were 
about to be transferred to other wards. In the 
former case the testing session was introduced as 
follows : я 

You may remember that when you first came 
to 19A you came here and were given several 
tests. These were part of the routine procedure 
connected with transfer to that ward. Now you 
are about to leave the hospital and you will be 
asked to take these tests again. "They are part 
of the routine procedure connected with the dis- 
charge process. Again I would like you to 
cooperate to the best of your ability. 

If the group included "failures" the instructions 
were: 

When you first came to 19А you came here 
and were given several tests. These were part 
of the routine procedure connected with transfer 
to that ward. It is part of our procedure also, 
that these tests be taken twice, once upon enter- 
ing the ward and, the second time, after you 
have been on the ward a certain length of time. 
This will be the last time you will be asked 
to take them. Again I would like you to cooper- 
ate to the best of your ability. 

In other respects the procedure W: 
that in the initial administrations. 
All initial and terminal test data were filed 
following their completion. The experimenters did 
not see the data until after the last subject had 


completed the program. 


as the same as 


Attrition Group 


During the 24-month 1 
course of the experiment, 25 patients were 
dropped from the experimental program. 
These failed to complete the program for 
one of the following reasons: (a) required 
major surgery; (b) discharged with less 
than 21 days in the experimental program; 
(c) left the hospital without permission ; 


period covering the 


TABLE 3 


REASONS FOR ATTRITION ву EXPERIMENTAL GROUP 


TABLE 4 


MEAN Days IN TREATMENT 


GG G I с Меап 
М 119.5 65.4 | 117.8 86.0 97.18 
Ps 78.6 | 61.4 | 129.8 | 89.5 | 89.83 
Pl 172.3 | 163.9 | 191.1 92.5 | 154.95 
Mean 123.47| 96.90| 146.23| 89.33 


(d) developed more acute psychotic symp- 
toms requiring transfer to a closed ward; 
(e) refused to cooperate in posttesting. 
Table 3 summarizes the reasons for attri- 
tion by experimental group. 


Time in Treatment 


Since the purpose of this investigation is 
to determine the efficacy and economy of 
the various treatment programs for differ- 
ent diagnostic groups, the amount of time 
spent in treatment, an important consider- 
ation in evaluating the efficacy and economy 
of treatment, was allowed to vary. Table 4 
presents the mean time, in days, for the 
different groups and Table 5 shows the 
analysis of variance for the same data. 
Table 5 indicates that significant differ- 
ences obtain for treatments and diagnoses 
but not for their interaction. 

Table 4 reveals that patients who received 
individual psychotherapy remained in the 
hospital longer than the other three treat- 
ment groups. А further glance at Table 4 
shows that the highly significant difference 


TABLE 5 


AwALYsIS OF VARIANCE OF TIME IN TREATMENT 


GG| G I G 
Required major surgery 1 2 
Discharged in less than ; 
21 days # 1 
Developed more acute | : à 
psychotic symptoms н i 3 
Left without permission 2 1 
Refused posttesting 
T 4 4 |10 


"Total 


حا الا اب ы‏ 


Mean 

Source of Variation df Square F 
Treatments 3 | 16,228.37 | 3.98* 
Diagnoses 2 | 40,714.00 | 9.99** 
Interaction: 
Treatment X Diagnoses | 6 5,451.67 | 1.34 
Within groups 84 | 4,073.69 
Total 95 


* Significant at .05 level. 
** Significant at .001 level. 
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between diagnostic groups results from the 
much longer treatment period of the long- 
term psychotic group when compared with 
the other two diagnostic groups. Table 4 
also reveals that the long-term psychotic 
control group remained in treatment a 
shorter period of time than long-term 
psychotics in psychotherapy. 


REsuLTS 


The results for each experimental instru- 
ment are presented in the following sec- 
tions.” Change is evaluated by: (a) com- 
parison of change score'® variances for the 
12 experimental groups and (b) comparison 
of differences between median change scores 
for treatments, diagnoses, and their inter- 
action. 


MMPI 


The MMPI, like all experimental instru- 
ments, was administered when the patient 
arrived on the experimental ward and when 
he left the ward. In addition to the stand- 
ard scales, the MMPI was scored for 
Barron's (1953) Ego Strength Scale (Es), 
Welsh's (1956) Anxiety (4) and Repres- 
sion (R) scales, and Simon's (1952) Scale 
of Inner Maladjustment (SIM). 

A comparison of the change score vari- 
ances for the experimental groups on each 
scale was accomplished by use of Bartlett’s 
test (Edwards, 1950) 11 

Heterogeneity of variance obtained on 
the Р, К, Mf, Pa, Pt, Sc, Es, SIM, R, and 
А scales. Table 6 presents the mean vari- 
ances for the 12 groups as well as the treat- 
ment and diagnostic groups. 

A comparison of the mean variance for 
treatments, diagnoses, and their interaction 
is contained in Table 7. 


“There were no significant differences (.05 
level) among the 12 experimental groups at initial 
testing on any of the assessment devices, 

10 Change scores refer to initial minus terminal 
test scores. Further use of the term "change score” 
refers to this difference, 

11 The change score variances for the 12 ex- 
perimental groups, together with their tests of 
significance, on each scale presented in this study 
тау be obtained from the authors. 


TABLE 6 


EXPERIMENTAL 
FOR THE 


MEAN VARIANCES FOR THE 12 
GROUPS ON CHANGE SCOR 
16 MMPI Scar. 


GG G I C Mean 
28 
N 148.5 128.50| 91.08 ид. 
Ps 152; 216.68| 05.89 179. 
PI [ 189.72| 194.88) 231. 
Меап 217. 178.32| 117.28 
5 for 


Table 7 shows that the mean variances W 
the 16 MMPI scales are significantly differ. 
ent between treatments, diagnoses, and Mw. 
interaction. Table 6 reveals that all e. 
ment groups have larger mean variance к 
the control with the group living treatme 
showing the greatest mean variance. 
the diagnostic groups, short-term psy ae 
exhibit greater mean variance than ДН 
psychotics and the long-term р5у© p 
group shows the greatest variance. 
higher variances in the short-term psy 
treatment groups compared with cont" e 
seems to account for the majority © 
interaction variance. dian 
Table 8 presents a comparison of me d 
change scores by treatments, diagnose? ^, — 
their interaction. The treatment group со е 
parisons and the interaction effects we 
evaluated by Friedman's test (Wilco? % 
1949). The diagnostic group compar! еї, 
were made Бу the median test (Sie 
1956). In these analyses, the only sig at 
cant directional change between treat! eG 
groups occurred on the Ma scale. Here 


hotic? — 


TABLE 7 


ES 
CoMPARISON ОЕ TREATMENTS AND DiAGNO? 


ON 16 MMPI SCALE VARIANCES 


Between Between Interaction: qos 
Treatments | Diagnoses | Treatment X Diag 
x (3 df) | xeQ af) х2 (6 df) 
10.83* | 15.154» 19.38** 


* Significant at the .05 level. 
** Significant at the .01 level. 
*** Significant at the .001 level. 
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and I groups showed gains in Ma, G group 
small loss, and C group a large loss. The 
only significant change between diagnoses 
occurred on Welsh's (1956) Ё scale. А 
reduction in R scores (repression) between 
initial and terminal testing occurred in the 
neurotic and short-term psychotic group, 
whereas an increase occurred in the long- 
term psychotic group. 

Change scores on the K, Pd, and SIM 
scales yielded significant interactions. In all 
three significant interactions, the greatest 
share of the variance is attributable to the 
differential response of long-term psychotics 
in psychotherapy when compared with the 
controls. Та these three interactions, one ог 
all of the three long-term psychotic groups 
in psychotherapy demonstrated changes in 
the maladaptive scale direction while the 
remain relatively the same or 
adaptive direction. On the 
other hand, nonpsychotics and short-term 
psychotics in psychotherapy change in an 
adaptive scale direction while the same 
diagnostic groups in the control condition 


changed only slightly or not at all. 


controls 
change in the 


Ward Behavior 


Ward behavior is another area of adjust- 
ment often used as a treatment criterion. 
Because of its frequent use and because 
changes in ward behavior are often said 
to occur concomitantly with treatment, a 
Ward Behavior Scale pertinent to the ex- 
perimental setting was constructed. 


The scale was constructed in the following 
manner. Psychologists, social service personnel, 
nurses, and aides familiar with the experimental 
ward setting submitted behavior items which they 
believed were indicative of patients' adaptation to 
the ward. 

Seventy-six of the 120 submitted items were 
selected by the experimenters for the Behavior 
Rating Scale. A review of the literature concern- 
ing behavior ratings revealed that 35 of the 76 
selected items were so similar to Hospital Adjust- 
ment Scale (HAS) items (Ferguson, McReynolds, 
& Ballachey, 1953) that these 35 HAS items and 
their alternatives described below, were utilized as 
part of the Behavior Scale. 

The 76 items were then grouped, a priori, into 
three areas of adaptation. They were (a) inter- 
personal communication—30 items, (b) self-care— 
31 items, and (c) participation in work and 
recreational activities—15 items. i 

The ratings were keyed in the following man- 
ner: for each of the 76 items, the experimenters 


TABLE 8 
COMPARISON OF TREATMENTS AND DiacNosEs oN MMPI CHANGE Scores 
O——— 
Interaction: 
MMPI Scale Between Treatments Between Diagnoses | Diagnoses X Treatment 
са 
xt df) x2(2 df) xr(6 df) 
r 
i 3.53 10.87 
rd pa 0.08 4.25 
3 K 0.64 0.75 18.75** 
4. Hs 5.41 0.65 P 
5.D 0.09 1.75 8.00 
А ` 1.75 . 
a T i i 
. M 2.43 0259 pin 
9. Pa zr» d 8:50 
10. РІ ЖЕ, 2 4.75 
11. Sc Е 3.05 6.06 
12. Ма 10:01» 1.00 10.25 
13. Es 0.94 1.09 8.00 
14. 4 1.06 12.19** 1.56 
15. R 0.65 1:25 16.88** 
16. SIM ` 


э, Significant at .05 level. 
Significant at .01 level. 
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made an a priori judgment as to whether the 
behavior described was adaptive or maladaptive 
in the ward setting. Then, four raters (one nurse 
and three aides) rated every patient one week 
after his admission to the experimental ward and 
again during his final week on the ward. Each 
item had two alternatives, one adaptive, the other 
maladaptive, ie. “The patient usually will reply 
if you speak to him," or, "The patient usually does 
not reply if you speak to him." An item was 
scored in the following manner. If the four raters 
checked the adaptive alternative, the item was 
scored four; if three of the four raters checked 
the adaptive alternative, the item was scored three; 
if two raters checked the adaptive alternative, the 
Score was two; if one rater checked the adaptive 
alternative, the score was one; if no rater checked 
the adaptive alternative, ie, if all four checked 
the maladaptive, the score for the item was zero. 
А. patient's total score was the sum of all items. 
His score for the three adjustment areas was the 
sum of the items for each area. 


Comparison of the change score vari- 
ances for the 12 experimental groups is 
presented in Table 9. Bartlett’s (Edwards, 
1950) test yields a X? of 44.76 which, for 
11 df, is significant beyond the .001 level. 
The marginal totals in Table 9 reveal that 
the greatest absolute behavioral change oc- 
curred in the group living therapy situa- 
ton and, diagnostically, in the long-term 
psychotic groups. 

: А comparison of behavior change medians 
15 presented in Table 10. 

This table reveals that no significant differ- 
ences in ward behavior median change 
Scores obtain between treatments, diagnoses, 
or their interaction. 


Holland Vocational Preference 
Inventory (HVPI) 


It has been found by Holland (1958) 
and others that vocational preferences re- 


TABLE 9 


VARIANCES FOR THE 12 EXPERIMENTAL GROUPS ON 
Warp BEHAVIOR CHANGE 


GG G I С Меап 
М 1,157.00] 757.43| 319.86/1,108.14| 835.61 
Ps 888.57) 188.29] 828.00| 399.71| 576.14 
PI 7,713.42 4,101.00|1,601.42|1,105.00 3,630.21 


3,253.0011,682.24| 916.43| 870.95 


TABLE 10 | 


COMPARISON OF TREATMENTS AND DIAGNOSES | 
ох BEHAVIOR CHANGE 


Inter- 
action: — 
Between | Between | Treat — 
Ward Behavior Treat- | Diagnoses) ment d 
Scale ments D | 
x?(3 df) | x2(2 df) | xe(6 A 
Total Behavior | 
Rating .86 1.00 n 
Personal Care 4.19 0.08 ۴ 
Participation in 1.259] 
Activities 1.00 3.4 Y 
Interpersonal 6 
Communication .86 .58 5.5 Г 


flect personality characteristics. Changes | 
life adjustment could, theoretically; be je 
vealed by such an inventory. The 3 
selected to evaluate this area was the 
land Vocational Preference Inve” 
(1958). bau 
Holland's Inventory contains 300 ү с 
comprising 12 scales. These scales 
briefly described as follows: pt 


. а геј | 

Scale 1. Physical Activity. This scale engt 
sents a concern with motor skills, physical $ 

and a concrete approach to life problems. 65 


ur! 
Scale 2. Intellectuality, This scale mea nt 
preferences for conceptual methods of adj" and 
and strong needs for understanding the WO resent 
perhaps one’s self. High scores may гер 
healthy ego functioning ("ego strength")- to 


PEDE r5 
Scale 3. Responsibility, This scale aPP are 
be an “oral receptive” scale. High score 
dependent, verbal, and responsible. rê 


H H ea 
Scale 4. Conformity, This scale is m 5 
of extraception in the sense of conforming: 
oriented, and conventional behavior. 


Scale 5. Verbal Activity, This is 2 
aggressive” scale. High scorers are charas; an 
as aggressive, oral, concerned with leaders” 

power. the 


КУУ, 

Scale 6. Emotionality. This scale assess шге 

stability of the subject. High scorers are cha 5 af 
ized as emotional and expressive. Low sco" 

characterized as cool or “flattened.” the 


5 б. 
Scale 7. Control This scale meas үф 
habitual inhibition of impulses to act out trol 
High scorers are regarded аз “оуег-сой nde 
and low scorers as “impulsive an 
controlled.” 
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Scale 8. Aggressiveness. This scale is a measure 
of aggressiveness with high scores representing 
aggressiveness and low scores passivity. 


Scale 9. MF. This is an empirically constructed 
masculinity—femininity scale. High scores repre- 
sent identification with or choice of typically 
masculine occupational roles; low scores represent 
the choice of feminine occupational roles. 

Scale 10. Status. This scale represents the sub- 
ject's need for status or prestige. 

Scale 11. Infrequency. This scale is similar to 
the MMPI F scale. It is intended to provide an 
estimate of the degree to which the subject makes 
unusual or infrequent occupational choices. 

Scale 12. Acquiescence. This scale indicates the 
response set of the subject, or the degree to which 
he responds "like." 

Bartlett’s test (Edwards, 1950) reveals 
that significantly different variances occur 
on the following scales: Conformity, Oral 
Aggression, Emotionality, MF, and Acqui- 
escence.!? The mean variances for the 12 
experimental groups are presented in Table 
11. Table 12 shows significant differences 
between mean variances for treatments and 
interaction. Table 11 shows the greatest 
mean variance in the three psychotherapy 
groups when compared to the control group. 
Тыз table also reveals that the significant 
interaction is primarily due to an increase 
in variance of long-term psychotics in psy- 
chotherapy, whereas no such increase ap- 
pears in the controls. 


12 Difference scores were computed in two ways. 
On those scales where the highest score is con- 
Sidered most adaptive, the difference scores yom 
determined by substracting the initial from the 
terminal test score. On other scales, where the 
mean scale score is considered most adaptive, 
difference scores were determined by computing 
the unit movement toward or away from the mean. 


TABLE 11 


MEAN VARIANCES FOR THE 12 EXPERIMENTAL 
GROUPS ON CHANGE SCORES FOR THE 
12 HVPI Scares 


GG G I c Mean 
DE 
21.87 | 48.67 | 38.51 | 34.12 
р ED 43.12 | 47.50 | 13.94 | 32.63 
PL 35-49 | 82.22 [104.18 | 19.02 | 64.88 
Mean | 37:96 | 46.74 | 66.98 | 23.82 


TABLE 12 


CoMPARISON OF TREATMENTS AND DIAGNOSES ON 
12 HVPI ScALE VARIANCES 


Interaction: 
Between Between Treatment X 
Treatments Diagnoses Diagnoses 
xr (3 df) x? (2 df) x7 (6 df) 
30.25** 4.16 20.52* 


*.01 level of significance. 
**.001 level of significance. 


Comparisons of treatments and diagnoses 
on median change are presented in Table 13. 

Table 13 indicates that there are no sig- 
nificant differences between treatments or 
diagnoses. Two significant interactions oc- 
cur—one on the intellectuality scale and the 
other on the masculinity-femininity scale. 
These two significant interactions, however, 
do not show any consistent differences 
when the median change scores for each of 
the 12 experimental groups are investigated. 
Due to the number of tests of significance 


TABLE 13 


CoMPARISON OF TREATMENTS AND DIAGNOSES ON 
HVPI CHANGE Scores 


т 
Inter- 

Be- Be- | action: 

tween | tween | Treat- 

HVPI Scale ments | Diag- | ment X 
noses | noses 

xr (3 df) |х (2 df)| х;(6 df) 
1. Physical Activity 3.24.| 2.95. | 75 
2. Intellectuality 1.01 „15 | 15,13* 
3. Responsibility 2.71 2:18 |" 9238 
4. Conformity 5.26 „25 | 39.26 
5. Verbal Activity 3:35 | 1.62 | 10.25 
6. Emotionality 1.69 75 6.75 
7. Control 5.08 ‚25 | 3.31 
8. Aggressiveness 1.18 | 1.02 | 3.06 

9. Masculinity- 

Femininity 5.09 61 | 17.75** 
10. Status 5.85 | 1-01 | 519 
11. Infrequency 13 .33 | 2.06 
12. Acquiescence 2.65 | 1.09 | 7.63 


* Significant at the .05 level. 
** Significant at the .01 level. 
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computed, one should approach these find- 
ings with caution. 


Q Sort 


(2 sort is another instrument frequently 
utilized to evaluate changes that occur dur- 
ing psychotherapy. In this study, a 100-item 
sort with five scales of 20 items each was 
used. 

The general factor structure of Sheer's ® 
Sort was followed; however, items appro- 
priate to the experimental population were 
constructed. The five areas that the О sort 
attempted to cover were: (a) social adapta- 


id D. E. Sheer. A self-concept test: Reliability 
and validity data. Personal communication. 1955. 


bility, (b) emotional control, (c) conform- 
ity, (d) self-acceptance, and (e) self- 
confidence. These were sorted both for self 
and ideal on initial and terminal testing: 
Each of the five scales contained 10 negative 
and 10 positive statements, the directionality 
of a statement being determined by the 4 
priori judgment of the authors. For e 
ample, the self-confidence scale had as 4 | 
positive item, “I can argue for my point 9 
view" and, as a negative item, “I do as 1 
am told.” In scoring, positive items We" 


А Я ive! 
were sorted,'' and negative items rece 


reader 


ê Koê i я E 1 s, the Е 
For instructions and pile numbers, Instr" 


is referred to the section on Experimental 
ments and Testing Situation. 


| Я «oh the 
given the score of the stack into which th у 
| 
| 


TABLE 14 
CORRELATIONS BETWEEN SELF AND IDEAL О Sorts AT INITIAL AND TERMINAL TESTING 
GG G I c 
> а! 
Subject Initial Terminal | Initial Terminal | Initial Terminal | Initial Term! 
r r r ۴ r , r 7 | 
Nonpsychotic 31 | 
1 .67 .39 .45 .62 53 M .32 `03 
2 —.14 .39 .39 .42 0 — —.33 —.02 79 | 
3 —.35 —.12 —.51 —.43 .36 60 . 81 02 
4 .39 .61 .46 .42 .34 .49 .10 47 
5 = .17 —.20 78 .90 .14 .66 .64 “09 
? .52 .66 .28 .09 —.45 —.49 —.07 “58 
: .03 — —.02 .54 .43 8 =mi .19 "s | 
—48 -.31 —.25 24 | —.07 .15 .50 | 
Short-term Psychotic 9 | 
$ 34 .32 21 -16 50 .60 .09 = | 
я 41 .33 —.14 .50 —.08 —.03 .06 “68 
Р .66 .16 —.07 .09 .04 .05 .58 165 
E —.11 .37 —.14 .02 .58 .73 .65 “46 
Я .38 .68 .54 61 .57 .69 .50 “gh 
1 —.09 .33 —.04 21 —.18 77 .73 "él 
8 —.07 .00 11 —.02 .56 .63 250 “gh 
.46 Al 11 19 —.54 .65 .74 ` 
Long-term Psychotic $2 | 
1 32 — —.08 .27 19 | —.16 ла 78 pt 
1 B 37 —.08 72 ды —.33 .58 28 M 
i E 43 —.05 .38 25 —.23 .61 “09 
ы E- —.15 .53 .59 —.16 | —.48 —.52 Ts ] 
6 ea .26 „20 .34 —.72 — .66 .07 “93 
7 к. —.38 .65 .45 .58 48 .87 "61 
i 5 — .08 .60 .29 .28 .54 .53 02 
05 .10 59 .24 .58 .03 .30 x 
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TABLE 15 


VARIANCES FOR 12 EXPERIMENTAL GROUPS ON 
z SCORE CHANGE 


GG | G I С | Mean 
N .069 | .074 | .151 | .064 | .090 
Ps “969 | .119 | .286 | .036 | .128 
Pl “169 | .229 | .094 | .117 | .132 
Mean .102 | .141 | .177 | .072 


the score of nine minus the pile number. 
The total score for each factor was the 
arithmetic sum of the 20 items in the scale. 
Thus, the following sets of measurements 
were obtained from the Q sort: (a) corre- 
lations between the 100-item self and ideal 
sorts at initial and terminal testing; (5) 
scores on self-sort on the five aforemen- 
tioned scales at initial and terminal testing ; 

‚ (c) score on ideal-sort on the five scales at 
initial and terminal testing. 

The usual comparisons made on Q sort 
data involve correlations between self- and 
ideal-sort at initial testing and between the 
same measures at terminal testing. Theo- 
retically, successful therapy is evidenced by 
a positive increase in the correlations. з 

The correlations between self- and ideal- 
sorts at initial and terminal testing are pre- 
sented in Table 14. These correlations were 
then transformed into 5 scores, and a 
population of differences between £ scores 
(initial-terminal) was obtained. 

The variances of this distribution for the 
12 experimental groups is presented _ in 
Table 15. Barlett’s test (Edwards, 1950) 
yields a x? of 13.42 which, for 11 df, does 


TABLE 16 


COMPARISON OF TREATMENTS AND DIAGNOSES ON 
z SCORE CHANGE 


Interaction: 
mem, | ages | Diagnos 
xG df) x2(2 df) x7 (6 df) 

г 2.85 4.00 4.50 


ТАВІЕ 17 


MEAN VARIANCES FOR THE 12 EXPERIMENTAL 
Groups ON CHANGE SCORES FOR THE 
5 SELF-SORT SCALES 


GG G I c Mean 
N 113.11| 124.80] 140.26| 151.20] 132.34 
Ps 150.60) 340.06| 840.63) 72.77 351.02 
Pl 311.43] 526.95] 145.23] 126.09) 277.43 
Mean 191.71] 330.60] 375.37) 116.69 


not reach the .05 level of significance; how- 
ever, the trend of larger variances in the 
three psychotherapy treatments and long- 
term psychotic diagnostic group occurs 
again. 

Table 16 presents a comparison of treat- 
ments, diagnoses, and their interaction on 
the median differences between = scores. 
This table reveals that no significant differ- 
ences obtain. 

The previous comparisons dealt with self- 
ideal correlations based upon the 100 items. 
The following analyses are for the five Q 
sort scales of 20 items each. The first 
analysis compares change scores on self- 
sort alone. А Bartlett's test (Edwards, 
1950) comparison of the self-sort change 
score variances showed significant differ- 
ences between the variances on the social 
adaptability, self-acceptance, self-confidence, 
and emotional control scales. 

The average variances for the five scales 
are presented in Table 17. A nonparametric 
analysis of the variances for the five scales 
is presented in Table 18 which shows that 


TABLE 18 


CoMPARISON OF TREATMENTS AND DIAGNOSES ON 
5 SELF-SORT SCALE VARIANCES 


Interaction: 
Between Between Treatments Ж 
Treatments Diagnoses Diagnoses 
xr (3 df) x?(2 df) xr (6 df) 
4.28 5.20 22.80* 


* Significant at the .001 level. 
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TABLE 19 


CoMPARISON OF TREATMENTS AND DIAGNOSES ON 
SELF-SORT CHANGE SCORES 


TABLE 20 


MEAN VARIANCES FOR THE 12 EXPERIMENTAL | 


Groups ON CHANGE SCORES FOR THE 
5 IpEAL-SonT SCALES 


Inter- 
Be- Be- | action: GG G I e Mean 
tween | tween | Treat- Е. 
Self-Sort Scale Treat- | Diag- | ment X y 86.941 47.37| 138.42] 140.77 103: 
ments | noses | Diag- — p, 95.80] 85.67 237.24] 49.16 116-53 
позаг. р] 200.91| 174.72| 162.95| 95.55| 158: 
2 Меап 127.88] 102.59] 179.54| 95.16 
xr (3 df) x*(2 df) 
1. Emotional Control 4.85 .58 . : T andi 
2. Conformity -71 | 5.25 As was previously mentioned, initial Ww. 
A A A Wes terminal ideal-sorts were also obta! 
5. Self-Confidence i 3.29 E These were scored on the five scales. Ж 
A comparison of the ideal-sort | 


* Significant at the .05 level. 
** Significant at the .01 level. 


the interaction alone is significant. Table 17 
suggests that the interaction is significant 
because nonpsychotics show relatively equal 
variances over the four treatments, short- 
term psychotics show larger variances in 
the three groups receiving psychotherapy 
with the largest in the individual therapy 
group, and long-term psychotics show the 
largest variances in the group living and 
group therapy treatments. 

Table 19 presents a comparison of the 
medians for the self-sort change scores. 
No significant differences obtain on any of 
the five scales between treatments or diag- 
noses. However, three significant inter- 
actions obtain. 

When the three significant. interactions 
are viewed collectively, it is evident that the 
major contributor to the significant inter- 
action is the differential response of the 
diagnostic groups to all three psychotherapy 
treatments compared with the control. The 
following schema generally holds for the 
three scales: 


АП Therapies Control 
Nonpsychotics Large Positive No Change or 
Change Small Positive 
Change 
Short-term Large Positive Small Positive 
Psychotics Change or 
Small Nega- 
tive Change 
Long-term Moderate Nega- Large Positive 
Psychotics ative Change Change 


score variances for the 12 experimen" 
groups revealed significantly different 
ances on four of the five scales: ето 
control, conformity, self-acceptance, 
self-confidence. «ances 

The average of the five scale var 
for the 12 experimental groups is ао 
Table 20. An analysis of the five vari 21. 
when treated as scores appears in Table he 
Only significant differences between ig, 
variances of the treatment groups 0 Qin 
Table 20 indicates that the variances gj. 
descending order of magnitude are: apy’ 
vidual therapy, group living, group the 
and control. core? 

An analysis of the median change 5 Gig" 
on ideal-sort is presented in Table 22: ре 
nificantly different changes оссигге rion? 
tween diagnostic groups on the emo ИН 
control, conformity, and social adapt? as 
scales. The short-term psychotic group ce 
the largest median improvement i? 


tion? 


TABLE 21 
ох 


Е5 
COMPARISON OF TREATMENTS AND D1AGNOS 
THE 5 IpEAL-SonT SCALE VARIANCES 


Interaction; 
Between Between Treatme os 
Treatments Diagnoses Diagn 
x2 df) x2(2 df) x 6d 
0 
9.66* 2.80 10.4 


* Significant at the .05 level. 


уап 
1 


TABLE 22 


| CoMPARISON OF TREATMENTS AND DIAGNOSES ON 
| IpEAL-SoRT CHANGE SCORES 


Inter- 
| Be- Be- | action: 
| tween | tween | Treat- 
| Ideal-Sort Scale Treat- | Diag- | ment X 
ments | noses | Diag- 
noses 

x2(3 df) |x? Q df) x?(6 df) 
1. Emotional Control 2.60 | 6.59*| 3.94 
2. Conformity 7.00 7.60*| 5.50 
3. Self-Acceptance 2.88 1.58 | 5.25 
4. Social Adaptability 2.16 6.78*| 5.06 
5. Self-Confidence 1.24 | 1.59 | 8.75 


| *.05 level of significance. 


tional control and conformity, whereas both 
the non- and short-term psychotic groups 
demonstrate greater median improvement 
in the area of social adaptability than do 
the long-term psychotics. 


Thematic Apperception Test 


4 Eight selected TAT cards were presented 
to the subjects at the initial and terminal 
testing sessions. The cards were selected 
by the authors for their value in evoking 
responses concerning interpersonal relation- 

In order of presentation they were: 

‚9, 13 MF, 7 BM, 6 BM, and 12 M. 
Protocols were scored in the following 
manner: All identifying information was 
first removed. Then, the initial and final 
protocols for each patient were presented 
to the experimenter who made a judgment 
as to whether one protocol was more adap- 
tive or whether they were adaptively equiv- 
alent. The scoring system for the protocols 


was: А 
т. Protocol А ї 
Protocol В w 
more rejected 


none. | 
II. Protocol А is judged more adaptive than 


: i developed 
j 1 B when A's stories reveal 
КОЕТ, B’s are descriptions of the cards. 
is j ive than 
] A is judged more adaptive 
ч. A “hoth reveal plots but A's are more 


well-developed than B's. 


s judged more adaptive than 
hen Protocol B has four or 
cards and Protocol A has 


i eo 
AE 
NOD 
v 
EN 
№ 
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IV. Protocol A is judged more adaptive than B 
when both reveal well-developed plots but 
A's stories contain direct affective state- 
ments, ie, love, hate, etc., while B's do not. 


V. Protocol A is judged more adaptive than 
B when both reveal well-developed plots, 
both contain direct affective statements, but 
Ргоюсо! А has more positive statements 
EE interpersonal relationships than 


Protocol A is judged more adaptive than B 
when both reveal well-developed plots, both 
contain direct affective statements, both 
have an equivalent number of positive state- 
ments concerning interpersonal relationships, 
but A's outcome is more positive than B's 
outcome. 

А After the judgments were completed, 
identifying information was revealed and 
scores were recorded. The scores were re- 
corded by comparing final with initial 
protocols as follows: final more adaptive 
than initial adaptively equivalent, or less 
adaptive than initial. 

Two experimenters independently com- 
pared the initial and final protocols of the 
same 20 patients by this scoring system. 
They agreed in 18 of the 20 (or 9096) of 
the cases. 

For statistical analysis the scores were 
dichotomized into protocols showing posi- 


VI 


TABLE 23 


TAT CHANGES FOR THE 12 EXPERIMENTAL 
CONDITIONS BY % 


Total 
GGG | С 


Positive change А j 
N | No change and 25 .0|50.0|87.5 
negative change 


Positive change г 4 5 5 
No change and 62 .5|25.0/62.5|100. 


negative change 


Ps 


50.0 


Positive change 25. 
7 0] 50.0 


Рі | No change and 
negative change 


Total % 
Positive change 
No change and 
negative change| 
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tive change and those showing no change 
or negative change. Table 23 presents the 
percentage of cases in the two categories 
for each of the 12 cells. 

Sheffield’s (1957) method was used to 
compare the treatment and diagnostic groups. 
Table 24 presents the апа1уѕ15.'° 

Table 24 indicates that differences be- 
tween treatments and diagnoses do not 
reach the .05 level of significance. The in- 
teraction is significant at the .01 level. 
Reference to Table 23, however, suggests 
that this interaction is significant because 
more positive TAT change is brought about 
by group methods and control than with 
individual therapy when nonpsychotics are 
treated; more positive TAT change occurs 
with the three psychotherapy groups than 
controls when short-term psychotics are 
treated; and more positive TAT change is 
brought about by individual therapy when 
long-term psychotics are treated. 


Follow-up 


A follow-up questionnaire was obtained 
for each patient six months after he had 
left the experimental program. The ques- 
tionnaire was completed by an individual 
with whom the patient was living, or, if he 
had not left or had returned to the hospital, 
by a psychologist familiar with the case.!^ 

Follow-up Scale. The questionnaire was con- 
structed to encompass several areas of life adjust- 
ment. Simplicity of items was stressed. The items 
were multiple choice, with a designated space for 
remarks if the respondent believed the structured 
choices to be inadequate. The questionnaire con- 
sisted of five objective and four subjective items. 
The five objective items concerned: (a) per- 
centage of. time employed, (b) hospitalization, 
(c) alcoholic indulgence, (d) antisocial behavior, 


15 'The within-cell variance is 1.98, 

16 [nformation was obtained on all 96 patients 
for Items 1 and 2. On Items 3 through 9, informa- 
tion was obtained on 94 of the 96 subjects. The 
scores of the two missing patients was the modal 
score for his experimental group; however, deletion 
of these two patients would not alter the results. 
Eleven of the 96 patients were failures, i.e., did not 
leave the hospital. Viewed diagnostically, nine 
were in the long-term psychotic group and two 
in the nonpsychotic group. Viewed by treatments, 
four were in group living (GG), two in group 
therapy (G), and five in individual therapy (I). 


TABLE 24 


COMPARISON OF TREATMENTS AND DIAGNOSES 
on TAT CHANGE 


Ѕачагеѕ df MS Е 
Treatments 1.67 3 1.56 ae 
Diagnoses 2.67 2 1.34 | 0. 
Interaction: 
Treatment Х = 
Diagnoses 35.33 6 6.06 Me 
Total 42.67 11 3.88 |1. 
*.05 level of significance. 
**.01 level of significance. 
Я jective 
and (с) number of friends. The four subject 


n 
items involved a judgment by the respondent ks 
cerning patient's (а) amount of verbal comm 
cation with others, (b) general adjustment, | of 
degree of problem behavior, and (d) deste 
illness. ^ into 

For analysis, the nine items were divided i 
the following dichotomies : 

1. Was, or was not, rehospitalized 

2. Was, or was not, employed full-time 

3. Has some, or no, close friends 

4. Frequently, or rarely, talks to the peop! 

whom he lives 


e with 


5. Getting along satisfactorily, or poorly 

6. Usually hard, or easy, to get along W! 5 

7. Well, or sick, most of the tim? 

8. Is, or is not, a problem drinker 1 Jaw 

9. Has, or has not, been in trouble with t rat 
Each of the nine items, therefore, has an arbi ета” 
adaptive and maladaptive alternative. The item? 
choric correlations among the nine follow-UP 
are presented in Table 25. itive 

Table 25 reveals that, in general, ро? The 


relationships exist among the items ; 
high degree of relationship among Item d 
б, and 7 indicates a common paramete! itu 
the theoretical possibility of item su hip? 
tion. However, the remaining relations jfi 
while generally positive and often sig {0 
cant, are rarely of sufficient magnitt е ої 
warrant the substitution of one ite?" 
another. у 
Comparisons of the follow-up дет, 
treatments, diagnoses, and their ings?) 
tions, computed by  Sheffield's 
method, are presented in Table 26. of 
Table 26 indicates that the amou? sig 
employment item is the only one MT 5. 
nificantly differentiates the treatment 87 сай 
The percentage of patients from pt fol 
ment group employed full-time 15 9 
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TABLE 25 


CORRELATIONS AMONG 


THE 9 FOLLOW-UP ITEMS 


1 2 3 4 5 6 7 8 9 

1. Rehospitalization 
2. Full-time employment .66** 
3. Number of friendships 12 .46* 
4. Amount of interpersonal 

communication „85 45 .78** 
5. General adjustment 56* 40 .54* 55* 
6. Degree of problem 

behavior .42 42 .78**| .36 .90** 
7. Degree of illness .59**| .57* | .50* 36 .93=*| .91** 
8. Type of alcoholic 

indulgence „29 .14 |—.08 ali .77**| .59* .60* 
9. Legal violations .65**| .25 |-.22 |—.22 .71* .62* .44 NAILS 


* Significant at .01 level. 
** Significant at .001 level. 


lows: Group living, 42%; Group therapy, 
33%; Individual therapy, 4296; and Con- 
trol, 8%. lt can readily be seen that the 
three treatments involving psychotherapy 
have a higher percentage of employed than 
the control and that the group living and 
individual therapy treatments have the high- 
est percentage of the three treatments. 
Table 1 indicates that this result cannot be 
explained by differences in employment 
prior to the study. 


Diagnostically, six of the nine items sig- 
nificantly differentiate the groups. Table 27 
presents the percentage of the three diag- 
nostic groups in the adaptive alternative for 
each item. This table indicates that in the 
areas of hospitalization, employment, and 
friendships, the short-term psychotic group 
displayed the most adequate adjustment, the 
nonpsychotics next, and the long-term 
psychotics the poorest adjustment. In the 
two areas involving judgments concerning 


TABLE 26 


COMPARISON OF 


ExPERIMENTAL GROUPS ON FOLLOW-UP ITEMS 


Follow-Up Area 


Between 
Treatments 


F ratio (3 df) 


Interaction: 
Treatment X 
Diagnoses 


Between 
Diagnoses 


F ratio (2 df) F ratio (6 df) 


Legal violations 


1. Rehospitalization 0.16 8.165* 1.02 
2. Amount of employment 2.78* 6.25** 0.50 
3. Number of friendships - 1.26 4.81** 1.60 
4. Amount of interpersonal communication 0.56 2.18 1.07 
5. General adjustment . 0.55 3.05* 0.44 
6. Degree of problem behavior 2.08 0.89 0.78 
qo; TIME ME 
„шее! alcoholic indulgence 0.15 2.12 0.92 


Note.—The within-cell variance is 1.98. 


* Significant at the .05 level. 
** Significant at the .01 level 
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TABLE 27 


% or DIAGNOSTIC GROUPS SHOWING BEST 
ADAPTATION ON THE 6 SIGNIFICANT 
FOLLOW-UP ITEMS 


Short- | Long- 

Non- | Term | Term 

Follow-up Area Psy- | Psy- Psy- 
chotic | chotic | chotic 

1. Out of hospital 50.0 87.5 40.6 
2. Employed full-time 34.4 | 50.0 9.4 
3. Some friends 62.5 75.0 37.5 
5. Getting along satis- 53.1 | 81.3 | 59.4 

factorily 

7. Well most of the time 53.1 84.4 | 59.4 
8. Nota problem drinker | 65.6 | 90.6 | 90.6 


general adjustment (Items 5 and 7), again 
the short-term psychotics were rated as 
making a significantly better adjustment 
than either the nonpsychotics or long-term 
PSychotics. Problem drinkers are more 
prevalent in the nonpsychotic group. 


Relationships among Selected 
Psychotherapeutic Criteria 


Thirteen scales chosen because they are 
frequently used as psychotherapeutic criteria 
and because they measure adjustment in 
different areas were selected for correla- 
tional analysis. These scales were: follow- 
up measures of Rehospitalization, Employ- 
ment, and Degree of Illness; Sc, Es, SIM, 
A, and R scales of the MMPI; Q sort; 
TAT; Ward Behavior; Job Behavior;'* 
and Finney's (1954) Group Therapy Scale. 

Only the last two scales will be described 
here since the others were discussed in pre- 
ceding sections. The Job Behavior Scale 
included 31 items dealing 


performance and interper 
in his hospital job assignm 


with the patient's 
sonal adjustment 


н ent. These iteris, 
like those in the Ward Behavior Scale, 
were stated in terms of adaptive and mal 


adaptive alternatives: ie, “The patient 
usually profits by his mistakes" or “The 
patient repeats his mistakes over and over 


17 Тһе Job Behavior Scale h 
with the American Documentat 
Footnote 8. 


las been deposited 
lon Institute. See 


again.” The rating was completed by M 
patient's job supervisor in the first an 
final week of the experimental program. A 
reliability coefficient of .93 was found for 
20 patients, by the method of successive 
ratings. Each patient in group therapy was 
rated on the Group Therapy Scale every 
30 days. : 

All scales were scored so that the most 
adaptive scores were numerically highest 
and the least adaptive were lowest. Thus, 
only positive correlations indicate an associ- 
ation between the theoretical adaptive levels 
of the scales. For all within-hospital meas- 
ures, final scores completed shortly ep. 
discharge were correlated. In the case G 
the Group Therapy Rating Scale, the laet 
rating was used. For the Q sort, the e 
correlations between self- and ideal-sor 
were ranked, with the highest positive cor- 
relation receiving the highest rank. The 
final 96 TATs were divided into two groups 
—most and least adaptive.'* 

The М for all scales was 96 except for 
Job Behavior with an N of 70 and the 
Therapy Rating Scale which had an N of 
41. All scales, except the follow-up items, 
were divided at the median for computa 
tional purposes. 

The correlations are presented in Table 
28. An examination of the matrix reveals 
two rather independent clusters of items: 
One involves the three follow-up items an 
the group therapy behavior rating. The 
other cluster includes the $6, A, Es, and 
SIM scales of the MMPI, the Q sort, and 
the TAT asa marginal member, 

Three of the scales are not contained i? 
these two clusters, They are Job Perform- 
ance, Ward Behavior, and Welsh’s (1956) 
R. Hospital job behavior is not significantly 
related to any other measure and Welsh’s 
R is correlated only with ward behavior: 
On the other hand, ward behavior is Si 
Hifcantly related only to the А scale, grouP 
therapy behavior, and TAT. 

That test scores are not significantly I 
lated to follow-up and are associated wit 


18 Genera] Principles of scoring outlined !! 
the TAT section were followed. 


$ 
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each other is not surprising (Fairweather, 
Moran, & Morton, 1956). But why is group 
therapy behavior the only within-hospital 
measure significantly related to follow-up? 
The group therapy rating completed by the 
therapist after the patient’s last session was 
used for the correlational analysis. Thus, 
this rating involves continuing observations 
of an individual over several weeks’ time. 
Further, in contrast to ward behavior and 
job performance ratings, the group therapy 
ratings were made by psychologists rather 
than by nurses, aides, and job supervisors. 
In addition, the group therapy situation 
involves intense interpersonal experiences 
which may be a better approximation of the 
extramural situation than either ward be- 
havior or the hospital job situation. There- 
fore, both the rater and the situation may 
have contributed to these significant rela- 


tionships. 


DiscussioN 


The purpose of this study was to test a 
series of hypotheses in an attempt to de- 
termine the relative efficiency of four com- 
monly utilized therapeutic procedures with 
three different patient groups. In view of 


the lack of agreement concerning the valid- 
ity of a specific criterion for evaluating 
treatment, several criteria were selected. 

In neuropsychiatric hospitals, economic 
considerations are important in determining 
the feasibility of therapeutic endeavors be- 
cause most hospitals operate on limited and 
frequently minimal budgets. Therefore, the 
amount of time taken to complete a speci- 
fied program is a relevant factor in the 
selection of treatment programs. The 
analysis of time in treatment yielded sig- 
nificant differences, both for treatment and 
diagnostic groups. Individual therapy pa- 
tients remained in treatment longer than the 
other three groups. Group living took less 
time, group therapy and control the least, 
and approximately equivalent, time. By 
diagnostic group, the long-term psychotic 
remained in treatment significantly longer 
than did the short-term and nonpsychotic 
groups. In terms of time alone, it appears 
that group therapy and individual work 
assignment are the most economical. Diag- 
nostically, nonpsychotics or short-term psy- 
chotics can be treated most economically 
(shortest time in hospital). 

While time in treatment is an important 
consideration in the selection of psycho- 


TABLE 28 


1C CORRELATIONS AMONG THERAPEUTIC CRITERIA 


5 6 7 Se], 9 [20 1 Peas 


'TETRACHOR 
1 2 3 4 
1. Follow-up Re- 
hospitalization 
2. Follow-up Employ- 
ment .66* 
3. Follow-up Degree 
of Illness .59ъ| .57» 
4. Finney's Group 
Therapy Behavior |.42“| .75* .09 
5, Ward Behavior .04 .28 |—.32 
6. Job Behavior ,02 |—.06 .08 |-. 
ў: { ME Welsh’s 4e -06 05 07 
8. MMPI Es -30 ‚1% |= 01 
9. MMPI Welsh's 4 .3n 18 | 116 
0: № ME Se 31 .03 29 
"n MMPI SIM 36 | .09 |—.09 
12. Q Bort 21 10 =4 
13. TAT 04 | а= 


2586 


36 [1 


Mi 
p 
Auson 
Со о 
v 


gui 


478 


в.05 level of significance. 
b.01 level of significance. 
0.001 level of significance. 
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therapeutic programs, it sheds little light 
upon the success or failure of such pro- 
grams. In an attempt to ascertain the re- 
lationship between time in treatment and 
successful follow-up, a “global follow-up” 
score"? was computed for each patient. A 
comparison of treatments, diagnoses, and 
their interaction on this criterion is pre- 
sented in Table 29. The only significant 
difference is between diagnoses; hence, com- 
parisons of success and time in treatment 
were independently computed for the three 
diagnostic groups. Table 30 shows no sig- 
nificant differences in mean time in treat- 
ment for the least and most adaptive 
follow-up groups." Thus, length of time 
in treatment appears to have little relation- 
ship to successful posthospital adjustment. 
This result is reasonable if we consider 
psychotherapy as a learning process. Such 
а process is unique to each individual and 
each learns at his own rate. If such is the 
case, the amount learned may be unrelated 
to time in the learning situation. Perhaps 
future studies might profitably attempt to 


discriminate slow and fast learners prior 
to treatment. 


‚ All nine follow-up items were dichotomized 
into adaptive and maladaptive alternatives. The 
adaptive alternative received a score of one and 
the maladaptive a score of zero. Each patient's 
Score was the sum of the nine items. 

20 Each diagnostic group was divided at its 
median global follow-up score. Least and most 


adaptive refer to patients below and above this 
median. 


TABLE 29 


CoMPARISON OF TREATMENTS AND DIAGNOSES ON 
GLOBAL FOLLOW-UP SCORES 


Interaction: 


Between Between Treatment X 
Treatments Diagnoses Diagnoses 
х (3 df) x? (2 df) xr (6 df) 
2:15 21.75* 4.38 


* Significant at .001 level. 


Change was evaluated in the preceding 
sections by analyzing the difference scores 
between initial and final testing on the 
MMPI, behavior ratings, HVPI, TAT, and 
Q sort. The first finding was that signifi- 
cant differences in variances obtain between 
treatment and diagnostic groups. When 
change scores are viewed globally, the gener!c 
conclusion, that the three treatments И” 
volving psychotherapy demonstrate signifi- 
cantly more absolute change than the 
control, seems warranted. These differences 
do not appear to be simply a function ° 
time between tests. Table 9 shows that the 
individual treatment group presents ар” 
proximately the same variance as the CO! 
trols despite the fact that these two group* 
spent the greatest and least time in treat- 
ment, respectively. Further, absolute change 
seems to be related to type of therapy И" 
volved. For example, the greatest variance 
in ward behavior is achieved by the grouP 


TABLE 30 
COMPARISON ОЕ MEAN Days IN TREATMENT FOR PATIENTS IN LEAST AND Most ADAPTIVE 
FOLLOW-UP CATEGORIES 
Diagnoses 
N Ps PI 
Most Least Most Least Most Least 
Adaptive Adaptive | Adaptive Adaptive | Adaptive Adaptive 

Messi 117.57 81.25 | 89.35 90.58 | 160.56 147.71 
SD 65.35 45.35 | 71.18 75.35 | 74.2 57.55 
t 1.79 . 04 .54 


EE 
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living subjects, whereas the greatest Q sort 
variance is achieved by the individual 
therapy group. Group living therapy was 
oriented toward interpersonal behavior in a 
variety of situations, while individual ther- 
apy was more closely oriented toward per- 
ceptual change. Thus, the different vari- 
ances of the treatment groups in different 
areas of change might be, in some measure, 
an indication of the attendant responses 1n 
the learning situation. : 

While significantly larger variances were 
found for treatment groups аз compared 
to controls, significantly different variances 
also occurred between diagnostic groups. 
Generally, significantly larger variances ob- 
tained for the long-term psychotics when 
compared with the short-term and non- 
psychotic groups. Again, it seems highly 
unlikely that the markedly different vari- 
ances are attributable to time alone, although 
clearly they are confounded with time. 
From observation of the patients in the 
treatment processes, it seems likely that the 
significantly greater change score variances 
of the long-term psychotic group can be 
attributed, in part, to the greater range of 
responses available to them. In this regard, 
the repertoire of responses available to the 
long-term psychotic in reacting to a close 
interpersonal relationship ranges from, to 
take one example, complete withdrawal to 
hyperactive participation. ‚Ву comparison, 
the range of responses available to perge 
psychotic is much more restricted and, on 
this theoretical continuum, the short-term 
psychotics would fall somewhere between 
these two groups. In addition, long term 
psychotics in the three psychotherapy 
groups contribute greatly to the total long- 
term psychotic group variance, while the 
controls contribute little, Thus, as far as 
the scales are concerned, the long-term psy- 
chotics, and to a lesser but usually signifi- 
extent, the short-term psychotics, 
in either a positive or negative 
ychotherapy while the 
$ remain compara- 


cant 
change € 
scale direction 1n PS 
e diagnostic group 
imis da rame without psychotkerapy. i 
There are, of course, several possible 
explanations for the. different variances. 
One is that patients т therapy get better 


or worse but generally do not remain the 
same, whereas patients not in therapy re- 
main about the same. А second is inherent 
in the construction of the scales themselves. 
Most clinical instruments are constructed 
on the assumption of an underlying theo- 
retical dimension of adjustment. Thus, posi- 
tive change on a scale usually refers to an 
increment of movement along the scale in 
the adaptive direction. In this context, sub- 
jects show improvement only by moving 
in the same direction. Clinically, however, 
this assumption may not be tenable. For 
example, two patients bearing the same di- 
agnostic label, one a rigid conformist and 
the other a nonconformist, might be ex- 
pected, if therapy was successful, to move 
in opposite directions. One might move 20 
positive scale units and the other 20 nega- 
tive units on a Conformity Scale. The 
mean of the two changes is zero. Two un- 
treated patients might move two positive and 
two negative units. Here again, the mean 
is zero, and the difference between means 
is obviously not significant. The variances, 
however, of the two groups would reveal 
the change. That movement in either direc- 
tion on a scale may be an adaptive change 
for the individual patient is emphasized in 


TABLE 31 


COMPARISON OF SELECTED CHANGE SCORES FOR 
PATIENTS IN LEAST AND Most ADAPTIVE 
FOLLOW-UP CATEGORIES 


Diagnoses 
N Ps Pl 
Scale e 

x? х? x 
Job Behavior .05 .39 .88 
Ward Behavior 13 .00 .00 
MMPI F .98 .68 .00 
MMPI K .00 .01 2.17 
MMPI Sc .00 .00 .45 
MMPI Es 6.22* 1.20 1.14 
MMPI A .00 .01 .04 
MMPI SIM 13 .68 .13 
HVPI 2 .98 .01 .45 
HVPI 3 .02 .01 .00 
Q Sort 1.14 .15 00 


* Significant at .05 level. 
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two additional analyses. Table 31 shows 
that only one significantly greater positive 
change occurs for the least and most adap- 
tive follow-up groups, and this change was 
in the opposite direction predicted, i.e., least 
adaptive on follow-up showed greater posi- 
tive scale change. On the other hand, Table 
32 reveals significantly different variances 
for the least and most adaptive follow-up 
groups on five scales for the short-term 
psychotics and three for the long-term psy- 
chotics. Furthermore, on each significantly 
different scale, the larger variance was as- 
sociated with the most adaptive follow-up. 
Thus, it seems that change in either scale 
direction can be adaptive for the individual 
patient. For this reason, it might be clini- 
cally more meaningful if a directional pre- 
diction of adaptive change were made for 
each patient prior to therapy. Thus, positive 
change for a patient would be determined 
by the direction in which clinicians agree 
that the patient should move rather than 
by the scale itself. 

Although generally the variances of the 
change scores significantly differentiated the 


treatment groups, differences between 
change score medians did not. Of the 
within-hospital measures, only the Ma scale 
of the MMPI showed a significant differ- 
ence and, in view of the large number of 
measures, it seems plausible that little sig- 
nificance should be attached to this finding. 
Interactions, on the other hand, revealed 
significance on nine scales. On six of the 
nine significant interactions, the three psy- 
chotherapy groups showed change in an 
adaptive direction for the short-term and 
nonpsychotic groups, while an adaptive dec- 
rement occurred with the long-term 
psychotics. Generally, nonpsychotics an 
short-term psychotics in the control group 
remained relatively unchanged, while long- 
term psychotics showed a small or large 
adaptive increment. As far as scale score 
adaptiveness is concerned, then, попрѕу” 
chotics and short-term psychotics change 
more adaptively with psychotherapy while 
long-term psychotics change more adaptivelY 
without therapy. и 
On within-hospital measures, the diag- 
nostic groups showed significantly differe? 


TABLE 32 


COMPARISON or SELECTED CHANGE SCORE VARIAN 


CES FOR PATIENTS IN LEAST AND MOST 


ADAPTIVE FOLLOW-UP CATEGORIES 


PERCHE 


Diagnoses 
N Ps РІ 
Most | Least Most | Least M 
t À t ost Least 
Adaptive|Adaptive Adaptive Adaptive! Adaptive Adaptive 
Scale $ 5% Е 52 ы Е 5% s? F 
Job Behavior 9.80 | 31.81 | 3.25 | 20.25 | 11.00 e 
r : Т 5 1.84 2. 
Ward Behavior 388.25 | 934.02 2.41 735.30 | 186.22 3.95* ee тв. " 05 
MMPI F 85.63 | 45.14 | 1.90 | 197.41 | 30:91 6.39*| 340.17 | 331.23 | 1.03 
MMPI K 32.12 52.69 1.64 74.29 52.06 1.43 254.10 52.39 4.85* 
MME [3 rt. ps 2.12 397.41 | 369.74 1.28 305.58 635.84 2.08 
: 40 | 1.41 | 89.54 | 32.35 | 2.73 | igo” :03 | 3.8 
MMPI 4 16:9» | 53.92) 1.23 | 174.90 | 16,72 | ба 167.83 64.23 2.61 
Mr. 2 ed 34.56 2.96 59.15 26.92 2.20 80.11 | 101.35 1.27 
HVPI 3 29.39 | 37.57 1.28 110.03 26.39 4.17*| 115.06 86.29 1.33 
MMPI SIM 262.87 | 324.25 | 1.24 | 510.08 | 195.75 | 3°61 lios 549.54 | 2.08 
Û sort 09)  :06| 1.50 6| —.04| 4:004 00 20 | 2.22 


* Significant at .05 level. 
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change on four scales. Although such find- 
ings must be interpreted cautiously, they 
suggest, as set forth in the second hypothesis, 
that the short-term and nonpsychotic groups 
improve more with treatment than long- 
term psychotics. : 
Follow-up items showed significant dif- 
ferences between treatment groups on the 
employment item. АП three treatment 
groups showed a greater percentage of full- 
time employed than the controls, which sup- 
ports the first hypothesis. The group living 
and individual therapy groups have the 
highest percentages of employed patients. 
The eight other items did not significantly 
differentiate the treatment groups. If em- 
ployment were chosen as the criterion of 
posthospital adjustment, then either the 
group living or group therapy treatments 
are desirable because they take less time 
and personnel than individual therapy and 
are significantly more successful than the 
control. Further, such therapy would be 
more efficient with nonpsychotics and short- 
term psychotics than with long-term psy- 
chotics. On the other hand, if any other 
follow-up item besides employment were 
chosen as the criterion, e.g. rehospitaliza- 
tion, then hospital work alone would be the 
most efficient treatment. In this regard, it 
should be noted that follow-up employment 
is the only item yielding any measure Е 
the patient's contribution to others. It wi 
be recalled that nine significant interactions 
occur on the within-hospital measurements, 
but no significant interaction 15 found on 


follow-up. Thus, differential test score 
change of the diagnostic groups to treat- 
ment is not revealed by the follow-up. 

Diagnostically, six of the nine items sig- 
nificantly differentiated the groups; the 
short-term psychotics demonstrate the most 
adequate global posthospital adjustment, 
nonpsychotics are next, and long-term 
psychotics have the poorest adjustment. 
These findings again support the second 
hypothesis. The significant differences on 
follow-up ratings indicate all too clearly the 
problem facing treatment planners. The 
long-term psychotic, in this study, is much 
less responsive to psychotherapy than the 
nonpsychotic or short-term psychotic. 

In addition, Table 33 indicates that pa- 
tients who did not receive tranquilizing 
medication did as well on follow-up as 
patients who did. Further, the table reveals 
that the results were the same for all three 
diagnostic groups. Since tranquilizing 
medication is frequently used to improve 
ward behavior, ie. reduce management 
problems, and since ward behavior is un- 
related to follow-up, it is not surprising 
that the patients who did not receive drugs 
did as well on follow-up as those who did. 

It is interesting to note that nine of the 
eleven failures in the total program were 
long-term psychotics in psychotherapy. No 
failures occurred in the control group. On 
the other hand, the three full-time employed 
long-term psychotics were in psychotherapy 
and none in the control. This finding sug- 
gests that long-term psychotics in intensive 


TABLE 33 


COMPARISON OF PATIENTS 


уно RECEIVED TRANQUILIZING MEDICATION IN LEAST AND Most 
ADAPTIVE FOLLOW-UP CATEGORIES 


Diagnoses 
N Ps Pl 
Least Most Least Most Least 
Ro Adaptive Adaptive Adaptive Adaptive Adaptive 
6 6 Drugs 11 7 Drugs 11 10 
Drugs й 18 NoDrugs 20 12 NoDrugs 18 14 
No Drugs х? = .03 x2 = 45 


х? = .30 
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interpersonal situations change for better or 
worse while avoiding such situations results 
in stabilization at a marginal adjustment 
level. 

The correlations among follow-up items 
are, in general, positive and significant. Yet 
it seems quite clear that the correlations 
are not of sufficient magnitude to warrant 
the assumption of a single criterion. Thus, 
the outcome of any treatment program is 
a function of the follow-up criterion chosen, 
which is based upon a value judgment. For 
instance, one would arrive at different con- 
clusions concerning the efficacy of treatment 
programs in the current study if one arbi- 
trarily decided that the criterion of hos- 
pitalization was more important than the 
criterion of employment. 

But further complications in the criteria 
problem are revealed in the correlation 
matrix shown in Table 28. Within-hospital 
measures, with the exception of the group 
therapy rating, are not significantly related 
to follow-up. Thus, ward behavior, long 
an accepted criterion in treatment evalua- 
tion, is not significantly related to the three 
follow-up items chosen and performance in 
the hospital work situation is not signifi- 
cantly related to any other measure. Tests 
form a cluster of interrelated scales which 
are, by and large, not significantly related 
to other criteria. The relationships between 
follow-up criteria and the Group Therapy 
Rating Scale suggests that observations by 
trained personnel of intensive interpersonal 
Situations within the hospital may be the 
best predictors of follow-up adjustment. 

The fact that there are two relatively 
independent criteria clusters, (a) subjec- 
tive self-evaluation as revealed in tests and 
(b) objective interpersonal evaluation as 
revealed in follow-up items and group 
therapy rating, caution against generalizing 
from test measures to within-hospital be- 
havior or follow-up behavior. Implications 
of these relationships for treatment evalua- 
tion studies using a single criterion are 
obvious. 


SuMMARY 


This investigation attempted to evaluate 
the relative efficiency of four psychothera- 


peutic programs with three patient groups 
on a number of different criteria. All pa- 
tients in the study lived on the same experi- 
mental ward. The four treatments were: 
(a) group living, in which all activities 
were group oriented, i.c, work, living ar- 
rangements, psychotherapy, etc., (b) group 
therapy and an individual work assignment, 
(c) individual therapy and an individual 
work assignment, and (d) a control group 
with an individual work assignment and 
no therapy. The three patient groups were: 
(а) a nonpsychotic group, (b) а short- 
term psychotic group, and (c) a long-term 
psychotic group. The design was a three 
by four factorial type analysis of variance 
with an N of eight in each cell and a total 4 
of 96. All patients were tested and rate 
upon entry into the program and upon 
departure. The instruments were: (a 
Ward Behavior, (b) MMPI, (c) ТАТ, 
(d) Holland Vocational Preference In- 
ventory, (e) Q sort, and (f) a six-month 
follow-up questionnaire, In addition, а 
patients in group therapy were rated every 
30 days on F'inney's (1954) Therapy Scale 
and 70 of the 96 were rated on job Рё 
formance, both upon entry and termination 
of the program. 


The following results obtained : 

1. The number of days in treatment Жан 
significantly different for both treatme? 
and diagnostic groups. The individua 
therapy group took the greatest time, group 
living next, with group therapy and com 
trol the least and almost equivalent time’: 
Diagnostically, long-term psychotics were 1 
treatment longest, with short-term PSY 
chotics and nonpsychotics requiring the least 
and almost equivalent times. 

2. Change score comparisons (initial 
terminal test scores) were computed fof 
variances and medians, Generally, signif 
cantly different variances occurred on m0? 
instruments. between treatments and diag” 
noses. The control group usually had the 
smallest variance and the three psycho" 
therapy groups the largest. Of the Tum 
treatments, group living or individu? 
therapy usually showed the greatest van 
ance. Variances were usually significant 
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different between diagnoses with the long- 
term psychotics having the largest vari- 
ances, followed in descending order by the 
short-term and nonpsychotic patients. 

3. Tests of significance between change 
score medians revealed only the Ma scale 
of the MMPI yielding a significant differ- 
ence between treatments. However, nine 
interactions were significant, and six of 
these followed the same pattern: nonpsy- 
chotics and short-term psychotics in the 
three groups receiving psychotherapy 
showed moderate to large adaptive change 
while long-term psychotics in the same three 
groups showed moderate to large change 
in a maladaptive scale direction. On the 
other hand, nonpsychotics and short-term 
psychotics in the control group showed rela- 
tively little change, and long-term psychotics 
in the same program showed moderate to 
large adaptive scale score change. Four 
scales showed significantly different median 
change between the diagnostic groups. On 
all of these scales the nonpsychotic and 
short-term psychotic groups changed more 
adaptively than the long-term psychotics. 

4. Correlations between the nine follow- 
up items revealed three that were so highly 
intercorrelated as to be considered a single 
criterion. Other items were, In general, 
significantly correlated but not of sufficient 
magnitude to warrant criteria substitution. 


Tests of significance between treatments 


revealed significant differences in employ- 
ment, with the three groups which had re- 
ceived psychotherapy having a higher 
percent of full-time employed than the con- 
trol The other eight follow-up items 
revealed no significant differences between 
treatments. When compared by diagnoses, 
six of the nine follow-up items significantly 
differentiated the groups. Overall, the 
short-term psychotics made the most ade- 
quate posthospital adjustments, nonpsy- 
chotics next, and the long-term psychotics 
the poorest. 

5. Thirteen frequently used scales were 
selected for correlational analysis. They 
were: three follow-up items, (Rehospitali- 
zation, Employment, Degree of Illness), 
three within-hospital behavior measures, 
(Ward Behavior, Job Performance, and 
Group Therapy Behavior), TAT, Q sort, 
and Sc, Es, A, R, and SIM from the 
MMPI. Two relatively independent clusters 
of interrelated measures appeared : (a) three 
follow-up items and group therapy behavior 
and (b) MMPI scales of Sc, Es, А, SIM, 
О sort, and TAT as a marginal member. 

6. Other comparisons revealed that 
*global" follow-up adjustment was unre- 
lated to time in treatment or receipt of 
tranquilizing drugs during treatment. 

The results were discussed in relation to 
treatment programs, psychotherapy criteria, 
and further research. 
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EXPERIMENTAL STUDIES IN THE TRAINING OF ORIGINALITY? 
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ГЕ difficulty in attempting to fa- 
cilitate original thinking is that it may 
not occur at all or at such infrequent intervals 
that reinforcements cannot be administered 
with sufficient frequency to effect an in- 
crease in such behavior. Thus, a funda- 
mental problem in the training of originality 
is to devise methods for increasing its oc- 
‘currence in the first place, thereby per- 
mitting the operation of reinforcement. We 
are assuming that originality can be learned 
and that the same principles of condition- 
ing hold as in other forms of operant be- 
havior. Some of the problems attendant 
upon this assumption have been discussed 
elsewhere (Maltzman, 1960). For our pres- 
ent purposes we need indicate only that by 

| originality, or original thinking, we mean 
behavior that occurs relatively infrequently, 
is uncommon under given conditions, and 
is relevant to those conditions. In order, 
then, to facilitate the occurrence of original 
behavior, techniques must be devised. for 
evoking many uncommon responses. Such 
training may then produce a disposition to 
give uncommon responses in other situa- 
tions. 

One procedure, used by  Maltzman. 
Bogartz, and Breger (1958) is to repeatedly 
evoke different associations to the same 
stimulus words in a free association situa- 
tion, This procedure prompts the subject 
to emit responses relatively low їп the 
response hierarchy elicitable by each stimu- 
lus, They found that subjects receiving this 


i ү ү = Nonr 

1 This research was Suppor З 
233050) гуй the Office of aval Research and 
‚ the "University of California. Reproduction in 
Whole or in part is permitted for any purpose o 


he United States Government. g 
am Bogartz, Nira 


2 на 
„° We аге indebted to William | д: NUS 
"Kozak, and Bonnie Simon for their assistance in 
Various stages of this research. 


i 


training with intermittent reinforcement of 
uncommon responses were significantly 
more original on a new list of words than 
subjects without this training. A second 
experimental group which did not receive 
verbal reinforcement approached signifi- 
cance in comparison with the control group. 
When instructions to be original were added 
to the experimental and control treatments 
they showed a marked increase in the 
originality of their associations, and both 
experimental groups were now significantly 
more original than the control group. The 
Unusual Uses Test of originality devised 
by Guilford and his associates (1950) was 
administered following the free association 
test. Equivocal evidence for transfer of the 
association training effects to this situation 
was obtained in the form of a significant 
triple order interaction. 

The essential features of the procedure 
responsible for the facilitation of original 
responses to new stimuli, however, cannot 
be determined from that study. The pur- 
pose of the present series of experiments, 
in part, is to determine what those relevant 


features are. 


EXPERIMENT I 


It may be that the important character- 
istic of the procedure employed by Maltz- 
man, Bogartz, and Breger (1958) is not 
evocation of responses relatively low in 
the hierarchv of each stimulus word, as 
was assumed, but simply the production 
of many different responses. The purpose 
of this first experiment, therefore, was to 
determine whether facilitation of uncommon 
responses to new stimuli would occur when 
different responses were evoked by the pres- 
entation of different stimuli rather than by 
repetition of the same stimuli. Two dit- 
ferent lists of stimulus words were employed 
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for this purpose. One list consisted of 
relatively common and the other relatively 
uncommon words, in order to determine 
whether the frequency of usage of the 
words would have a differential effect upon 
the uncommonness of test responses. 
Another control group not employed in the 
previous study (Maltzman, Bogartz, & 
Breger, 1958) was added to this experiment. 
This group responded in the same fashion 
to repeated presentations of a given stimu- 
lus word. Such a condition would enable 
us to determine whether the number of re- 
sponses per se, or the number of different 
responses, is the relevant variable influenc- 
ing test performance. 


Method 


Subjects. The subjects (Ss) were 292 students 
drawn from introductory psychology classes. 

Stimulus materials, The stimulus words used in 
the initial list and the test list for all groups were 
selected from the norms obtained by Wilson 
(1942), and are the same as those used in a 
previous study (Maltzman, Bogartz, & Breger, 
1958). Evocation of a relatively small number 
of different responses in a free association situa- 
tion was the criterion for the selection of these 
stimulus words. The different training words pre- 
sented to two of the groups following the initial 
list were selected from the Thorndike-Lorge count 
(1944). The list of relatively common words were 
chosen from among the 500 most frequently occur- 
ting words in the count. The list of relatively 
uncommon words were selected from words oc- 
curring not more than 6 times per 1,000,000. 
Guilford's Unusual Uses Test (1950) was ad- 
ministered after the completion of the free asso- 
ciation test list. 

Procedure. The Ss were treated in identical 
fashion on the initial presentation of the free 
association training list, the first 25 stimulus words 
for all the groups. At the start of the experiment 
the Ss received the usual free association instruc- 
tions to respond as quickly as possible with the 
first word that came to mind. 

After completion of the initial 25-word training 
list one control group (C) was given the test list 
of 25 new stimulus words. The previous instruc- 
tions were repeated before presentation of the test 
list, as they were for all the other groups. A 
second control group (Cr) received five additional 
presentations of the same training list with instruc- 
tions to try to give the same response to a given 
stimulus word each time. They received the test 
list following the last repetition of the initial 
training list. 

Two experimental groups received a single pre- 
sentation of the initial 25-word training list fol- 


lowed immediately by 125 different words. One 
group (Xz) received a list of words with а low 
frequency count, while the other group (2 
received words with a high count, They were given. 
the test list following the completion of the entire 
training list. A 
After completion of the training list a. third 
experimental group (Х) received five addition 
presentations of the list with instructions to тера 4 
as quickly as possible to each word, but to giv he 
different response than the one used before. ate 
test list was administered following the last re? 
tion of the training list. 
All groups received 
immediately after the completion of the fri 
ciation test list. А form 
The experiment was conducted in group ining 
Stimulus words for the free association or Ws 
and test were read aloud by E, and S recor fi 
responses in a booklet provided. Each page & js 
booklet had 25 numbered blank spaces 4 ed 
responses. А stimulus word was presente! Ws 
5 sec. during the initial training list and Е отв 
list for all groups. The additional training nt the 
presented groups Хи and Xz, were presented ^ sist 
same rate. Repetitions of the initial training 1 


est 
al- Uses T 
the Unusu 25 asst” 


words for groups X and Cr were presente 
sec. intervals. 


responses occurred to each stimulus , tiot 
was determined for the free associ er? 
training and test lists. Ss’ responses M 
then scored on the basis of the O tat an 
frequencies, and each 5 was assign? ге 
originality score which was the mee ere 
quency of his responses. А low score 

fore represents high originality. 4 fev 

failed to respond to a stimulus 


Results t 
The frequency with which the differ 
of 
ye 
recorded an illegible response, but no 5 et 
less than 24 readable responses. Я 
was based on the number of legible 
Sponses recorded. or 
Table 1 shows the mean originality S arst 
obtained by the various groups on the est: 
Presentation of the training list, the P nal 
A simple analysis of variance show? pe 
they did not differ significantly prior 1o nt 
introduction of the different treat yer 
(F =2.22; df = 4,287 ; P > 05). Но? 
neity of variance was indicated РУ 
Hartley test. of* 
Table 2 shows the mean originality NT 
obtained by the various groups on t * aff 
association test list. An analysis of C? we 
ance of the free association test score? | 
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TABLE 1 


MEAN PRETEST FREE ASSOCIATION ORIGINALITY 
Scores IN Exe. I 


Cond. N Mean SD 
G 64 91.69 19.74 
Cr 62 92.70 16.42 
x 60 86.39 25.89 
Хи 60 97.06 19.35 
Хх, 46 95.89 20.64 


planned in order to increase the precision 
of the experiment. However, а test of the 
homogeneity of regression of the test on 
the training scores indicated that this as- 
sumption of the analysis could not be met 
(Р < .005). 

А simple analysis of variance of perform- 
ance on the free association originality test 
was therefore conducted. An F of 1495 
was obtained (df = 4287; P < .001). The 
results of t tests showed that the two con- 
trol groups did not differ significantly from 
each other and the three experimental 
groups did not differ significantly. How- 
ever, each of the three experimental groups 
was significantly more original than each 
of the control groups, P < .001. 

Originality scores for performance on the 
Unusual Uses Test were obtained for each 
S by a procedure somewhat similar to the 
one employed for the free association test. 
The frequency with which different uses 
occurred for each of the six different ob- 
jects mentioned in the test was first deter- 
mined. In the previous study by Maltzman, 
Bogartz, and Breger (1958) an originality 
score was obtained for each S by counting 
the number of responses he gave to each 


TABLE 2 


ту SCORES ON FREE ASSOCIATION 


;AN ORIGINALI 
Misi Test IN Exe. I 


test item that occurred less than 10% of 
the time in the obtained sample. This pro- 
cedure was not feasible in the present study 
which employed more than twice the number 
of Ss, since unique responses typically com- 
prised more than 10% of the responses to 
each item. The originality score for each S 
therefore was taken as his total number of 
unique responses. Any response was de- 
fined as unique if it occurred only once in 
the sample to a given test item. A second 
score obtained for each S was his total 
number of nonunique or common responses. 
This may Ье taken as a measure of fluency. 

Table 3 shows the mean number of 
unique responses obtained by the various 
groups on the Unusual Uses Test. The re- 
sults from one S in Group Cr were dis- 
carded, because he incorrectly recorded his 
uses in the free association test booklet 
rather than the booklet provided for the 
Unusual Uses Test. Since Hartley's test 
indicated that the variances of these scores 
were heterogeneous (P < .01) and the dis- 
tributions were highly skewed, it was de- 
cided to adopt the .01 level for rejection 
of the null hypothesis. An analysis of 
covariance was not conducted because of the 
absence of a significant correlation between 
unique responses and originality on the free 
association pretest. 

А simple analysis of variance of the 
originality measure obtained from the Un- 
usual Uses Test gave an F of 1874 (df — 
4,286; P < .001). Since the variances were 
significantly different, half the appropriate 
degrees of freedom were used in the t 
tests of the differences between groups. 
Results of these tests showed that Group 
X gave significantly more unique responses 


TABLE 3 


MEAN NUMBER OF UNIQUE USES IN Exe. І 


Cond. Mean SD Cond. Mean SD 
оп! 

23.18 c 2.58 2.30 

c 10-30 20.99 Ce 1.03 1.06 

Cr “19 18.42 x 4.75 3.54 

x $3.40 24.57 Xu 2.21 2.08 

: x 2.72 е 
s 56.44 18.64 X, 7 2.36 
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TABLE 4 


Mean NUMBER оғ Common Uses IN Exp. І 


Cond. Mean SD 
€ 14.44 4.84 
Cr 14.59 4.58 
X 15.65 5.09 
Xn 15.42 4.18 
XL 15.62 4.34 


than any of the other groups, P < .001. 
Groups Хи, Xi, and C did not differ 
significantly. However, Groups C and 
X, were significantly more original than 
Group Cz at the .001 level. Group Хи was 
significantly more original that Cpr at the .01 
level. A rank order analysis of variance 
yielded comparable results (H = 52.00; df = 
4; P <.001). 

Table 4 shows the mean number of non- 
unique or common responses obtained by 
the different groups on the Unusual Uses 
Test. Hartley's test indicated that the vari- 
ances were homogeneous. A simple analysis 
of variance yielded an F of .92, indicating 
that the groups did not differ significantly 
in terms of their fluency of common re- 
sponses on the Unusual Uses Test. Analy- 
sis of covariance was again contraindicated 
by the absence of a significant correlation 
between frequency of common uses and 
originality on the free association pretest. 

A basic assumption in the use of the 
training procedure employed with Group X 
is that it would increase the uncommonness 
of the responses to the training words. Fig. 
1 shows the mean percentage of unique 
responses occurring on successive repeti- 
tions of the training list. It was necessary 
to employ the percentage of unique re- 
sponses rather than the mean or median 
weighted frequency scores, because of the 
highly skewed distributions of scores ob- 
tained for the 25 training words. By the 
third repetition over half the responses of 
the average S were unique. It is clear from 
Fig. 1 that the training method employed 
with Group X had the desired effect of 
inducing an increase in the uncommonness 

of the responses emitted. 


Group C and Group X in the present 
experiment are similar to conditions em- 
ployed in the study by Maltzman, Bogartz, 
and Breger (1958). In the latter experi- 
ment the difference in originality between 
these two groups only approached signifi- 
cance on the free association test, but was 
significant when combined with verbal re- . 
inforcement or instructions to be original. 
In the present experiment Group X showed 
a highly significant increase in free associa- 
tion originality in the absence of verbal re- 
inforcement or differential instructions to 
be original. In the previous experiment only 
equivocal evidence of facilitation on the 
Unusual Uses Test was obtained. Here, the 
training administered Group X in the free 
association situation produced a highly s18- 
nificant facilitation of originality оп the 
Unusual Uses Test. 

The different results obtained in the two 
experiments may only be due to the increase 
in the size of the sample in the present 
experiment. However, it is possible that 
the change from an individual to a grouP 
experiment also contributed to the difference 
in results. Conducting a free association 
experiment in group form grants a degree 
of anonymity to S’s responses which is not 
the case when he is run on an individua 
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basis. In the group experiment there also 
are necessarily more different cues present 
that are potentially the occasion for many 
responses less likely to occur when S is 
alone with E in a bare room. Both of these 
characteristics may have contributed to the 
appearance of a training effect in this ex- 
periment of a greater magnitude than the 
one reported in the earlier study (Maltz- 
man, Bogartz, & Breger, 1958). In any case 
the magnitude of the training effect obtained 
in this study is consistent with the results of 
the subsequent experiments reported here. 
Groups Хи and Хх, were employed in 
order to determine whether evoking many 
different responses would produce the same 
effect on test performance as the training 
provided Group X. The assumption was 
that the occurrence of any given response 
increases the disposition of responses asso- 
ciated with it to occur, Evidence of such 
an effect has been reported by Judson and 
Cofer (1956) and by Judson, Cofer, and 
Gelfand (1956). Providing for the occur- 
rence of many different responses during 
free association training should therefore 
increase the reaction potential of many more 
different responses on the test list as com- 
pared with the control groups. That an 
effect of this sort may be operating is sug- 
gested by the results of the free association 
test where the three experimental groups 
do not differ significantly, but are signfi- 
cantly superior to the control groups. 
However, the results of the Unusual Uses 
Test in terms of originality scores indicates 
that the training administered Group X 
involves more than the arousal of many 
intraverbal associations. On this test, Group 
X gave significantly more unique responses 
than Groups Хи and Ху, as well as the two 
control groups. These results indicate that 
the procedure of repeated evocation of dif- 
ferent responses to the same stimulus words 
produces a more general dap ps pro- 
duce uncommon responses than the pro- 
cedure of evoking different responses to 
different stimuli. The effect of this арчы 
furthermore, was restricted to increasing the 
number of uncommon uses. There were no 
Significant differences among the groups їп 
terms of their fluency of common responses. 


The significantly poorer performance on 
the free association originality test of Group 
Cr than the three experimental groups indi- 
cates that the increased originality of the 
experimental groups on this test cannot 
be attributed to the number of responses 
evoked per se or to habituation factors. The 
three experimental groups and Group Cr 
emitted the same number of responses dur- 
ing training. The difference obtained must 
be due to the uncommonness of responses 
per se emitted by the experimental groups 
or the uncommonness of responses emitted 
to the given stimulus words. 

The fact that Group Cr gave significantly 
fewer unique uses than Group C indicates 
that the training in the free association 
situation of repeatedly evoking the same 
response to a given stimulus interferes with 
the tendency to give uncommon uses. To 
some extent, the repeated occurrence of 
common responses to the same stimuli in- 
creases the tendency to emit common re- 
sponses in other situations, just as the 
repeated occurrence of uncommon responses 
to the same stimuli increases the disposition 
for original behavior in other situations. 


ExPERIMENT II 


The purpose of this next experiment was 
to further explore possible relevant vari- 
ables and alternative methods of facilitating 
originality. Two groups of Ss were ex- 
posed to uncommon associations which they 
simply read, in order to determine whether 
emission of uncommon responses by S is 
a necessary condition of the originality 
training procedure. In an attempt to explore 
the extent to which the method of facilitat- 
ing originality is peculiar to the training 
materials employed, a group of Ss was 
given repeated presentations of items from 
the Unusual Uses Test and tested with the 
free association materials. Conditions X 
and C of the previous experiment, which 
we shall hereafter call the standard ex- 
perimental training and control conditions, 
were also employed in this experiment. 


Method 
Subjects. The Ss were 251 students drawn from 
introductory psychology classes. 
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Stimulus materials. The stimulus words used in 
the initial training and subsequent test lists are 
the same as those in Experiment I. However, their 
function was reversed in the present experiment. 
The list previously employed as the training list 
was used as the test list in this experiment, and 
the test list in the previous experiment was used 
as the training list. Guilford's Unusual Uses Test 
was administered after the completion of the free 
association test list to all groups except Х.. 

Procedure. All Ss were presented the initial 
training list and the final test list of stimulus words 
in the same manner, including the usual free 
association instructions to respond as quickly as 
possible with the first word that came to mind. 

The standard control group (C) received only 
a single presentation of the training list followed 
by the test list of words. The standard experi- 
mental condition (Xi), as in the previous study, 
received five additional presentations of the train- 
ing list with instructions to respond as quickly as 
Dossible, but to give a different response than the 
one used before to each stimulus. 

А second experimental group (X:) was also 
asked to respond differently to repeated presenta- 
tions. However, the stimuli employed were the 
Six items from the Unusual Uses Test. The Ss 
were instructed to write a different use following 
the oral presentations of cach item. Five repeti- 
tions of the items were administered. A third 
experimental group (Хз) was presented with a 
booklet of 125 pairs of items. The Ss in this 
condition were instructed to underline the member 
of each pair which goes more readily with the 
stimulus word orally presented to them. The 
stimulus words were five repetitions of the train- 
ing list. Response pairs were unique responses to 
these words culled from the results of Exp, I. 

The final experimental group, Х., received the 
Same response pairs as Group Xs, but without the 
Preceding stimulus words. These Ss were in- 
structed to underline the member of each pair that 
they thought to be more familiar. 

Stimulus words for the free association training 
and test lists were read aloud by E, and $ recorded 
his responses in a booklet provided by E. The Un- 
usual Uses Test was presented in written form 
to all groups except Group Xs. Stimulus words 
were presented every 5 sec, during the initial free 
association training and test. Repeated presenta- 
tions of the stimuli for originality training oc- 
curred at 10 sec. intervals, Fifteen Sec. were per- 
mitted for the response to each unusual uses train- 
ing item in Group Х.. 


Results 


Frequency distributions of the responses 
to each of the training words in Exp. I 
were employed as norms for scoring the 
responses to these words which were em- 
ployed as the test list in the present study. 


TABLE 5 


SSOCIATION ORIGINALITY 


MEAN PRETEST FREE 


SCORES ІХ 
Cond. N Mean SD 
Xi 64 87.72 en 
¥ 46 83.56 a 
33 48 87.85 18. pa 
Xi 39 85.66 18. н 
C 54 83.48 23.1 


The frequency with which the different j 
responses occurred to each of the nal | 
words in the present experiment was Е 
termined, and S's’ responses were scored 0 
the basis of these frequencies. : d 
Table 5 shows the mean originality зей 
obtained by the various groups on the "d 
presentation of the training list. А —- 
analysis of variance indicated that s 
groups did not differ significantly prior 3 
the introduction of the experimental pee 
ments (Р=.55). Homogeneity of varian? 
was indicated by the Hartley test. E 
Table 6 shows the mean originality sco? 
obtained by the various groups on the А 
association test. Appropriate tests indicat 
that the variances of the test scores wA 
homogeneous as well as the regression a 
test scores on training scores. The гете, 
sion of the test scores on the initial наш! 
Scores was .69. as 
A simple analysis of covariance Vit 
therefore conducted in which the origina” 
test scores were adjusted for initial di 
ences on the training list. An F of s 
was obtained (df = 4, 245; p < .001). e 
results of ¢ tests indicated that Group 


TABLE 6 


50° 
UNADJUSTED AND ADJUSTED (Мвлм) Free AS 
CIATION TEST ORIGINALITY SCORES IN 


Exp. II 

Cond Mean SD Mea? 
13 
X 55.09 20.44 53-4 
X: 80.02 25.88 81.48 
X 87.82 19.58 807 
X, 84.37 21.90 82.2 

C 80.67 24.08 g. 


EXPERIMENTAL STUDIES IN THE TRAINING OF ORIGINALITY 7 


TABLE 7 


MEAN NUMBER OF TRANSFORMED (VX + .5) 
Unique Uses IN Exp. П 


Cond. Mean SD 
Xi 1.65 .56 
Xs 1.35 .58 
X; 1.23 .54 
© 1.17 .43 


differed significantly from each of the other 
groups (Р «.001), whereas these four 
groups did not differ significantly from 
each other. 

The frequency with which different uses 
were offered for each item in the Unusual 
Uses Test was determined from the norms 
of Exp. I, and an originality score assigned 
cach 5 based on the number of unique 
responses he submitted. A second score 
obtained for each S was his total number 
of nonunique or common responses. Since 
the score distributions for unique responses 
were skewed, and the variances were hetero- 
geneous, as in the previous study, the trans- 
formation VX + .5 was applied to them. 

Table 7 shows the mean number of trans- 
formed unique responses obtained by the 
various groups receiving the Unusual Uses 
Test. The variances of the transformed 
scores were homogeneous as well as the 
regression of unique responses on the free 
association pretraining scores. An analysis 
of covariance of the unique responses re- 
gressed on the pretraining scores yielded an 
F of 10.31 (df =3, 207; P < .001). How- 
ever, since the regression was only .02 no 
increase in precision over a simple analysis 
of variance was obtained, and the adjusted 
means which differ only slightly from the 
unadjusted means will not be presented.? 
Individual comparisons among the groups 


s 
3 The statistical analyses of the results from this 


experiment were performed on » e ve ex 
i Y ts of the analysis o 
ironic computer. The resul f 
covariance are reported even though y це 
sentially the same as from an analy s о ec 
since the computer program was for oea зд 
and the computations of the regressions we 


tained at the same time. 


by means of i tests showed that the standard 
experimental training condition, Group Xi, 
was significantly more original than each of 
the other conditions (P « .001). The other 
conditions did not differ significantly from 
each other. 

Since the distributions of scores were 
highly skewed due to the large number of 
failures to give unique responses, the trans- 
formation could not normalize the data. A 
rank order analysis of variance was there- 
fore conducted. Results comparable to the 
previous analysis were obtained. The Н 
was 29.82 (df —3; P < .001). 

Table 8 shows the mean number of com- 
mon uses obtained by the various groups 
on the Unusual Uses Test. The variances 
were homogeneous as well as the regression 
of common responses on the free associa- 
tion pretraining scores. An analysis of 
covariance of the common responses re- 
gressed on the pretraining scores yielded 
ап F of 541 (df=3, 207; P « .001). 
However, since the regression was only 
— .02 no increase in precision over a simple 
analysis of variance was obtained. In- 
dividual comparisons among the conditions 
by means of f tests showed that the three 
experimental groups did not differ signifi- 
cantly from each other. However, each of 
these three groups gave significantly more 
common responses than the standard con- 
trol group (Р < .05). 

Results obtained in this experiment for 
Condition X, confirm the results of Exp. I 
that repeated evocation of different re- 
sponses to free association stimuli is fol- 
lowed by an increase in the originality of 
responses to new stimulus words and to 
items on an Unusual Uses Test as compared 
to a control group (C) in which such train- 


TABLE 8 


Mean NUMBER or Common Uses IN Exr. IT 


Cond. Mean SD 
Xi 20.09 5.17 
Xs 19.26 5.91 
Xa 21.08 5.63 
c 17.03 5.18 
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ing is omitted. Conditions X, and X, were 
introduced in order to determine whether 
the uncommon responses must be evoked 
by the stimulus words in order for the 
training to be effective. In the former con- 
dition uncommon responses were evoked as 
textual responses (Skinner, 1957) following 
the stimulus words, whereas in Condition 
X, only the textual responses occurred. 
Neither condition was followed by signifi- 
cant increases in originality on the tests 
employed as compared with Group C, and 
they obtained significantly lower scores than 
the standard experimental condition, Group 

x These results show that with the stimu- 
lus materials employed, uncommon re- 
sponses must be evoked as intraverbal re- 
Sponses by the stimuli if there is to be an 
increase in originality in the test situations. 

It may be argued perhaps that the asso- 
ciations provided Groups X, and X, are 
not uncommon. Although this is a possi- 
bility it seems rather unlikely, since each 
association occurred only once in a similar 
sample of approximately the same number 
Of Ss. Furthermore, since a pair of un- 
common responses was given for each 
stimulus word or ordinal position, twice as 
тапу uncommon responses were available 
to Groups X, and X, than to Xi, Thus, 
even though some of these associations may 
have been common to the former groups 
they still should have received at least as 
many uncommon associations as Ss in 
Group X,. 

Another hypothesis which may be sug- 
gested to account for the difference among 
the experimental conditions is that the pro- 
cedure employed with Group X, induces 
the inhibition of common responses as the 
result of repeated presentations of the same 
stimulus words and the accompanying in- 
structions, thereby permitting the appear- 
ance of uncommon responses, Thus, the 
facilitation of uncommon responses in the 
test situations occurs at least in part be- 
cause of a disposition to inhibit common re- 
sponses as well as an increase in the ex- 
citatory potential of uncommon responses, 
Conditions Хз and X, do not provide for 
the inhibition of common responses, but 


evoke the uncommon responses directly 
as textual responses. ‹ 
Evidence apparently contradicting this in- 
terpretation stems from another study 
(Maltzman, Brooks, Bogartz, & Summers 
1958) which found that uncommon uses 
occurring as textual responses may facilitate 
performance on the Maier two-string 
problem. Examination of the latter к. 
periment suggests, however, that the resu 1 
obtained there are not necessarily in СОП 
flict with the inhibition interpretation. M 
the latter experiment the uncommon textua 
responses were evoked in the presence АН 
stimulus objects subsequently found in t7 
two-string problem. The excitatory po 
tial of uncommon responses to the Г. 
stimuli could therefore be increased by the 
training procedure employed. But in e 
present experiments the training and £2 
stimuli were different. This suggests tha ^s 
training method evoking uncommon we 
verbal responses will produce a more В 
eral effect than training with uncom 
textual responses, and implies further t d 
if uncommon uses for objects not four” 
in the two-string problem were to be wos 
ployed, facilitation of problem solutio 
would not occur. О 
Although Conditions X, and X, did P 
show a significant increase in originality 
the two test situations, they did yie se 
significant increase in common resp” o 
on the Unusual Uses Test, an effect и 
obtained in Condition X,. This finding y | 
gests that uncommon responses, whe h 
textual or intraverbal, tend to facilitat? «e | 
occurrence of other responses, to facili ot 
fluency. However, since this result Wa de 
anticipated, the experiment was not 10 
signed with an adequate control grou? of 
determine whether an increase in СОП, 
uses is peculiar to the evocation of un^ ner 
mon responses during training or whe if 
such an increase would occur as WÊ ей. 
common responses were initially eV?” jp 
It should be noted that Condition ^” se 
Exp. I did not show a significant inc" the 
in common responses as compared p 
control group, while highly reliable inc” pi 
in original responses were obtained. Та 
discrepancy in the results of the two €% 
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ments suggests that the present findings of 
increases in common responses should be 
accepted with caution. The results of sub- 
sequent experiments are equally incon- 
Sistent. 


EXPERIMENT ПІ 


The purpose of this experiment was to 
determine whether Group X. in Exp. П 
failed to show a significant training effect 
due to the relatively small amount of train- 
ing it received, five repetitions of the six 
items from the Unusual Uses Test. In 
contrast, the standard experimental con- 
dition received 25 stimulus items repeated 
five times. Before it can be concluded that 
unusual uses items cannot be used as train- 
ing materials for the facilitation of original- 
ity, the effects of more extensive training 
with these materials must be explored. 
However, amount of training can be varied 
in two ways, by increasing the number of 
itéms, and by increasing the number of 
repetitions of the items. Both variations 
were employed in this experiment. 


Method 


Subjects. The Ss were 162 students drawn from 
introductory psychology classes. ——. 

Stimulus materials, The same initial and final 
free association lists were employed as in Exp. П. 
Items from the Unusual Uses Test previously em- 
ployed were also used in this experiment. „Six 
additional items were used in two of the conditions. 
These items were, a brick, a pair of scissors, a 
pencil, a five-cent piece, a light bulb, and a bar of 
soap. 
Procedure. Four different groups of Ss were 
employed similar to Group X: in Exp. П. They 
all received a single presentation of the free asso- 
ciation training and test lists. Group A received 
10 repetitions of the six items while Group (on 
received 20 repetitions of these items. Group B 


TABLE 9 


т FREE ASSOCIATION ORIGINALITY 


E RETES 
Mean P Scores IN Exp. Ш 


Сопа N Mean SD 
uso s r$ m 
C 6 x 20) 38 82.05 ay 
В (12 X 5) 43 78.75 1 
D(12 x 10) 38 84.71 23.09 


TABLE 10 
MEAN FREE ASSOCIATION TEST ORIGINALITY SCORES 
IN Exp, Ш 
Cond. Mean SD 
А ( 6 X 10) 65.39 28.27 
B(2X 5) 68.27 21.23 
Ci( 6 X 20) 79.17 24.20 
D (12 X 10) 77.22 29.32 


received five repetitions of 12 items and Group D 
received 10 repetitions of these 12 items prior ta 
presentation of the free association test. 


Results 


Performance on the free association 
training and test lists was analyzed and 
scored in the same fashion as in the pre- 
vious experiments. Table 9 shows the mean 
originality scores obtained by the four 
groups on the training list. A two-way 
analysis of variance showed that the column 
means, 60 vs. 120 responses, differed sig- 
nificantly prior to the introduction of the 
experimental treatments (F=5.21, df= 
1, 158; Р < .05). Factors responsible for 
this initial bias are not at all apparent. 
Table 10 shows the unadjusted mean origi- 
nality scores obtained by the four conditions 
of this experiment on the free association 
test list. Appropriate tests showed that the 
variances of the training and test means 
were homogeneous. A test of homogeneity 
of regression of the test scores on training 
scores, however, was significant (F — 46; 
df =3, 120; P = .01), thus precluding an 
analysis of covariance. 

Fortunately, however, the assumptions of 
analysis of covariance were met in the com- 
parison of primary interest. 

Comparison of the two conditions in this 
experiment given six items with 10 and 20 
repetitions, conditions A and Ci, respec- 
tively, with conditions X, and C of the 
previous study would indicate whether the 
number of repetitions has a significant 
facilitating effect upon free association 
originality. Table 11 presents the adjusted 
free association test means of these four 
groups. Their pretraining mean originality 
scores do not differ significantly. Their 
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TABLE 11 


UNADJUSTED AND ADJUSTED (MEAN’) ORIGINALITY 
Test Scores FOLLOWING DIFFERENT AMOUNTS 
OF TRAINING IN Exes. II AND III 


Cond Mean Mean" 
€ 80.66 78.81 
X«(6 X 5) 84.36 81.07 
A (6 X 10) 65.39 72.03 
Ci(6 X 20) 79.17 77.67 


variances and regression on pretraining 
Scores likewise were homogeneous, The re- 
gression was .66. A simple analysis of 
covariance gave an F of 1.62 (df = 3, 169; 
P > 05). Thus, no evidence of a facilitat- 
ing effect of training with unusual uses was 
obtained. 

Results obtained with Group X. of Exp. 
IT and the four groups of Exp. III indicate 
that the method of facilitating original re- 
Sponses by repeated evocation of different 
responses to the same stimuli does not work 
with all materials. 

А basic assumption of the approach em- 
ployed here is that if original behavior can 
be induced to occur repeatedly through 
Some training procedure, it will be more 
likely to occur in new situations. The train- 
ing method employed, therefore, must evoke 
тапу uncommon responses. If this condi- 
tion is not met, transfer of originality to 
new stimulus situations cannot occur. It 
has previously been determined that re- 
peated evocation of different responses to 
free association materials does lead to in- 
creased originality on successive training 
trials. 

The mean percentage of unique uses 
responses occurring on successive repetitions 
of the training list was therefore determined 
for Group A. Fig. 2 shows the results of 
analyzing the training data, and suggests 
that a reliable increase in unique responses 
may have occurred during training. A 
trend analysis was therefore conducted. Tt 
yielded a significant trials effect (F = 2.60; 
df =9, 378; P —.01) and Ss’ effect (Е= 
4.24; df = 42, 378; P < 001). Thus, а sig- 
nificant increase in the percentage of unique 
uses occurred with successive repetitions of 


the training list. However, a comparison of 
the effects of training with free association 
stimulus words and uses, Figs. 1 and 2, 
indicates that the former produced consider- 
ably more unique responses. The i 
percentage of unique responses is ood 
in the former case after a single repetitio! 
of the list of 25 words than the latter * 
10 repetitions of the 6-item list. Bu 
probably due to the relatively low ne 
age of unique uses on the first presentat x 
of the unusual uses list. A low aper 
level would retard the growth of a trainns 
effect. : 
Another possibility is that the kinds P 
responses evoked by the stimulus e 
of critical importance. The free associa sii 
stimulus materials evoke different бас 
responses on each presentation of a ae 
item. In contrast, entire phrases are eV? P 
by the unusual uses items, many of the =. 
ponents of which are common to à “ite | 
number of the phrases. This would have 1 
effect of reducing the uncommonness 0 aft 
phrases evoked during training, since ws 
of the component verbal responses oc 
repeatedly. 

In addition, it appears that a a 
limited number of different uses are а sto | 
able for each item. Given the instructio 4 | 
respond to each repetition of an item rent 
the inability of Ss to do so with а diffe! tly 
operant, they respond with essent? 
synonymous uses repeatedly. Tt is appa! a 
that considerably more experimenta 
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relat! giis 
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search is needed in order to adequately 
determine the training conditions and char- 
acteristics of stimulus materials that are 
necessary to facilitate originality in test 
situations. 


EXPERIMENT IV 


Condition X, in Exp. II indicated that 
originality training with unusual uses items 
does not affect performance on the free 
association test of originality. That is, the 
experimental procedure previously found 
to be effective, the evocation of different 
responses to the same free association 
stimuli materials, did not have a comparable 
effect when the stimuli were the names of 
common objects, and the responses were 
unusual uses for these objects. Another 
study, Exp. IIT, which varied the number 
of items and the number of repetitions 
of the items, likewise failed to produce 
a significant change in the uncommon- 
ness of responses on the free association 
test of originality. It was concluded 
that repeated presentations of the unusual 
uses items was ineffective because it did 
not produce a large enough increase in the 
uncommonness of the responses evoked 
during training. 

The failure to modify originality test per- 
formance by training with unusual uses 
leaves another important problem un- 
answered. If, as we have assumed, original- 
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ity may be learned by the same principles 
as other forms of operant behavior, it 
should be a function of the amount of train- 
ing administered. But since training with 
the unusual uses items failed to produce any 
significant effect on test performance, the 
latter hypothesis could not be adequately 
studied in the above experiments. 


The purpose of the present experiment, 
therefore, was to determine whether per- 
formance on tests of originality would vary 
as a function of the number of repetitions 
of the training list. Training was conducted 
in the free association situation, since re- 
liable effects upon originality have pre- 
viously been obtained under these condi- 
tions. However, in order to increase the 
generality of the results of this series of 
studies somewhat, new training and test 
words were employed as free association 
stimuli. 


Method 


Subjects. The Ss were 201 students drawn from 
introductory psychology classes. 

Stimulus materials. The stimulus words used in 
the initial training and free association test lists 
were drawn principally from Woodworth and 
Wells (1911), and were selected on the basis of 
their tendency to evoke opposites. This would 
have the effect of producing a high communality 
of responses to each stimulus word, i.e., relatively 
few different responses would be evoked. Table 12 
shows the free association training and test words 
employed. There were 24 words in the training 


. TABLE 12 
SriMULUSs Worps EMPLOYED IN THE FREE ASSOCIATION TRAINING AND Test Lists IN Exp. IV 
Training List Test List 
1. yes 13. dirty ? m 
2. good 14. wrong A : 
3. east 15. empty . above 
4. day 16. after 4. north 
5. love 17. deep 5. top 
6. soft 18. large 6. wet 
Т, far 19. tall 7. up 
& mooth 20. open 8. long 
= ly 21. come 9. big 
m ad 22. male 10. front 
еа! 
| 23. cold 
2 СЕ? 24. low 
2. 105 
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list and 10 in the test list. Guilford's (1950) 
Unusual Uses Test was administered aíter com- 
pletion of the free association test list. 

Procedure, The four principal groups were 
treated in identical fashion on the initial pre- 
sentation of the free association list, the first 24 
stimulus words for these groups. The usual free 
association instructions employed in earlier experi- 
ments of this series were administered. The 
standard control condition (C) received the test 
list of 10 new stimulus words following completion 
of the initial 24-word training list. Since the free 
association training and test words were selected 
because they gave a high proportion of opposite 
responses, it was considered possible that respond- 
ing to the first 24 stimulus words might induce a 
tendency to emit opposites on the free association 
test list. A second control group (Cx) was there- 
fore employed to determine whether this effect 
would occur. Ss in this group received only the 
free association test list, omitting the 24-word 
training list. 

Three different experimental groups received one, 
five, or ten repetitions of the training list prior to 
Presentation of the free association and Unusual 
Uses tests. The latter test was employed in a 
slightly modified form from the standard рго- 
cedure. The items were Presented individually 
rather than in the usual booklet form with a time 
limit for each of the two sections of the test. 
Under the latter conditions Ss tend to devote a 
disproportionate amount of time responding to the 
first item in each section, thereby giving relatively 
few responses to later items in the test. Presenting 
the items individually would tend to equate the 
number of responses given to each item. 


Results 


Since these training and test words had 
not been used in the previous experiments, 
response norms for the stimuli had to be 
constructed. The frequency with which the 
different responses occurred to each stimu- 
lus word in the training and test lists was 
therefore determined and Ss responses 
were then scored on the basis of the ob- 
tained frequencies, 


Table 13 shows the mean originality scores 
obtained by the various groups on the first 
presentation of the training list. Homo- 
geneity of variance was indicated by the 
Hartley test. But a simple analysis of vari- 
ance indicated that the groups differed 
significantly prior to the introduction of the 
experimental treatments (F = 4.81; df — 3, 
155; P « .005). The results of ¢ tests in- 
dicated that Group X; was significantly 
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TABLE 13 


MEAN Pretest Е ASSOCIATION ORIGINALITY 
Scores IN Exp. IV 


Cond. N Mean SD 
с 44 90.01 2 
35 42 93.68 lr 
Ke 39 73.14 33.54 
Ber 34 88.14 20.4 


superior to each of the other groups on the 
pretest (P « .05). 

Fortunately, the regression of the free 
association test scores on the pretest scores 
was considerable, . 87. Tests of hetero” 
geneity of regression and homogeneity к, 
variance were not significant, permitting. i 
analysis of covariance in which the od 
nality test scores were adjusted for id 
initial differences in the training list. Ta 
14 shows the mean unadjusted and adjust 
originality scores obtained by the variou 
groups on the free association test. d 
simple analysis of covariance was conduct? 
and an F of 2094 was obtained (df 8 
154; P < .001). The results of t br 
showed that Groups X; and X,, did а 
differ significantly. However, all other om 
ferences among the four groups were stat! 
tically significant (P < .01). | 1 

А t test was conducted between Сн 
С and Сх (N =42; M = 73.47; с = 36.9 Е 
in order to determine whether the emissis 
of opposites on the training words wo 
increase the tendency to give opposites 
the test list. If such were the case, Group; 
would have significantly higher test scor 
than Group Су, since opposite respons 


TABLE 14 


350° 
UNADJUSTED AND ADJUSTED (MEAN') FREE А 
CIATION TEST ORIGINALITY SCORES IN 


Exe. IV 
Cond Mean SD Mean 
0 
€ 82.88 28.91 79.69 
Xi 68.76 33.05 t 
Xs 36.03 28.12 put 
Xo 46.03 33.62 + 
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were the most frequently occurring re- 
sponses to the stimulus words. This was not 
the case (+= 1.32; df=84; P > 10). 

The frequency of unique responses on the 
Unusual Uses Test was determined for each 
5 on the basis of the norms for this test 
derived from Exp. I. A second score ob- 
tained for each S was his total number of 
nonunique or common responses. Since the 
score distributions for unique responses are 
skewed, and the variances heterogeneous, 
the transformation VX + .5 was applied to 
them. 

Table 15 shows the mean number of 
transformed unique responses obtained by 
the various groups receiving the Unusual 
Uses Test. Hartley’s test indicated that the 
variances of the transformed scores were 
homogeneous. Since, as in previous studies, 
there was no significant correlation between 
the number of unique or common responses 
and originality on the free association pre- 
test list, a simple analysis of variance rather 
than covariance was conducted. The results 
of the analysis were significant (Ё = 5.82; 
df =3, 155; P < .001). The results of ¢ 
tests indicated that the three experimental 
groups did not differ significantly from each 
other. Each of the three experimental 
groups gave significantly more unique re- 
sponses than Group C (P < .05 for Group 
X; vs. C, and P < .001 for the other two 
experimental groups vs. Group C). 

Table 16 shows the mean number of 
common responses obtained by the various 
groups on the Unusual Uses Test. The 
variances were homogencous as indicated 
by the Hartley test. A simple analysis 
of variance was conducted, yielding a sig- 
nificant result (F = 4.62; df = 3, 158; P < 


TABLE 15 


MEAN NuMBER OF TRANSFORMED (МХ + .5) 
UNIQUE Uses IN Exp. IV 


Cond. Mean SD 
1.01 .45 

x 1.43 54 
X; 1.24 #5 
1.43 -64 


Хь 
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MEAN NUMBER or COMMON Uses IN Exp. IV 


Cond. Mean SD 
С 19.48 5.42 
х; 21.19 5.06 
Xs 23.85 6.13 
Xo 20.94 4.57 


:01). The results of ¢ tests showed that 
Group X; gave significantly more common 
uses than each of the other two experi- 
mental groups (P < .05) as well as Group 
C (P < .01). No other differences among 
the groups were significant. 

Results from the free association test of 
originality indicate that each of the three 
experimental groups was more original than 
the control group. Our analysis, however, 
cannot indicate the precise nature of the 
trend produced by varying amounts of train- 
ing, and the experiment was not designed 
for this purpose, but only to determine 
whether or not some form of trend exists. 
The data obtained show that reliable changes 
in free association originality may be pro- 
duced by varying the amount of train- 
ing in the standard experimental procedure. 
The particular form of the trend, however, 
is no doubt a function of the length of the 
training list as well as the number of its 
repetitions. Thus, a factorial experiment 
must be conducted in which both the length 
of the training list and the number of 
repetitions are varied before it may be pos- 
sible to specify the function that obtains 
between amount of training and originality. 

Although all three experimental groups 
gave significantly more unique responses 
than Group C, the experimental conditions 
did not vary as a function of the number 
of repetitions received. The basis for the 
absence of a trend among the experimental 
conditions on the Unusual Uses Test as 
contrasted with the free association test 
cannot be determined, and one can only 
speculate at this point as to why this dis- 
crepancy was obtained. A possibility may 
be the lack of power of the simple analysis 
of variance as compared with covariance 
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and/or sampling error in the case of Group 
X, These same factors may also be re- 
sponsible for the significantly greater num- 
ber of common uses offered by Group X; 
as compared with the other conditions. 


Evidence that performance will vary 
systematically with the number of repeti- 
tions of the training list, even if on only 
one of the originality tests, lends some 
further support to the interpretation of 
originality as a form of learned behavior 
obeying the principles of operant condition- 
ing. 

These results further highlight the failure 
in Exp. ПТ to obtain any effect upon 
originality when unusual uses items were 
employed as the training materials. The 
fact that a single repetition of a 24-item 
training list produced a reliable increase in 
originality on both tests in the present ex- 
periment clearly indicates that the failure 
to obtain an effect in the earlier experiment 
was not due to an inadequate amount of 
training. 


EXPERIMENT V 


ТЕ, as we have assumed, originality can be 
learned according to the principles of in- 
strumental conditioning, then it should show 
some degree of persistence following train- 
ing. Since retention is one of the character- 
istics of learned behavior, the disposition to 
emit uncommon responses induced by the 
standard experimental training procedure 
should also show some degree of per- 
manence. The purpose of the present study 
was to determine whether this in fact is 
the case. 


Method 


Subjects. The Ss were 177 students drawn from 
introductory psychology classes. 

Stimulus materials. The lists of stimulus words 
used in the initial free association training and 
subsequent tests are the same as those employed 
in the first three studies, and were administered 
in the same order as in Exp. JI. As before, the 
Unusual Uses Test was administered after the free 
association test list. 

Procedure. АП Ss were presented the initial 
free association training and final test lists of 
stimulus words in the same fashion. Two control 
groups received only а single presentation of the 


TABLE 17 


MEAN Pretest FREE ASSOCIATION ORIGINALITY 
Scores IN Exp. V 


Cond. Ne Mean SD 
X. 62 81.76 22.95 
C 58 80.50 26.07 
X 27 71.46 21 “р 
С. 30 79.47 27. 


i К is ed 
reviously used. ore discard d к 
ееп data from these Ss as well as their unusual uses 


training list. Group C received the free ао: 
tion and Unusual Uses tests approximately Pu 
hour later. During the interval of time beter 
the training and test sessions, SS were engage ais 
work on a series of 10 paper and pencil m S 
Group С. was dismissed following the initia mth 
entation of the free association training list E 
instructions to return in 48 hours. The free ? а 
ciation and Unusual Uses tests were administe? 
at that time. Two experimental conditions, * ‘the 
X:, experienced comparable delays between E 
initial and final test sessions. However, 45 i andi 
vious experiments, they received five а ditio th 
presentations of the initial free association list V the 
instructions to give a different response than 
one used before to each stimulus word. 


Results 


HU 
Table 17 shows the mean originali 
scores obtained by the various group? : 
the initial presentation of the training that 
A simple analysis of variance indicate iof 
the groups did not differ significantly ре 
to the introduction of the experime? 
treatments (F = .67). Homogeneity , st. 
variance was indicated by the Hartley tit 

Table 18 shows the mean origin’ ne 


scores obtained by the various groups ga 


TABLE 18 


В sare А 
UNADJUSTED AND ADJUSTED (MAN) FREF 
CIATION TEST ORIGINALITY SCORES IN 


Exe. V 
Cond. Mean SD 
X 71.58 26.16 
È 84.47 24.57 
х, 77.12 23.13 
C: 87.05 26.05 
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free association test of originality. Appro- 
priate statistical tests indicated that the 
variances of the test scores were homo- 
geneous as well as the regression of test 
scores on training scores. 

Since the training and test scores were 
correlated significantly, a two-way analysis 
of covariance was conducted in which the 
originality test scores were adjusted for 
initial differences on the training list. 

An Е of 21.9 (df=1, 172; P < .001) 
was obtained for the originality training 
effects. An F of 5.58 (df=1, 172; P< 
.05) was obtained for the delay effect. 

The frequency with which different uses 
were offered for each item in the Unusual 
Uses Test was determined on the basis of 
the norms from Exp. I, and originality and 
common response scores obtained as in the 
previous studies. Since the distributions of 
scores for unique responses were skewed, 
and the variances were heterogeneous, the 


transformation VX +.5 was applied to 
them. 

Table 19 shows the mean number of 
transformed unique responses obtained by 
the four groups. Hartley’s test indicated 
that the variances of the transformed scores 
were still somewhat heterogeneous, Г = 
.05. As in previous studies, the measures of 
unique responses and common responses 
were not correlated significantly with origi- 
nality on the initial free association list, 
thereby precluding the use of the analysis 
of covariance. 

A two-way analysis of variance of the 
transformed unique responses yielded an Е 
of 15.47 (df=1, 172; P< 001) for the 
originality training effect. An insignificant 
F of 1.10 was obtained for the delay effect. 


TABLE 19 


UMBER OF TRANSFORMED (VX +.5) 


Mean N 
UNIQUE Uses IN Exp. У 


Cond. Mean SD 
6+ 

X 1.42 64 
© 1.06 35 
Ф 1.49 -53 
©; 1.21 E 


TABLE 20 


MEAN NUMBER OF COMMON Uses IN Exp, V 


Cond. Mean SD 
x 18.06 4.84 
c 18.15 5.24 
Xs 17.74 537 
C: 15.67 3.97 


Table 20 shows the mean number of com- 
mon responses obtained by the four groups 
on the Unusual Uses Test. The variances 
were homogeneous. An F of .77 was obtained 
for the originality training effect, and 3.32 
for the delay effect (df — 1, 172, in both 
cases). Neither effect was statistically sig- 
nificant. The results from the two different 
tests of originality indicate that highly re- 
liable originality training effects do persist 
for some time. Thus, some additional sup- 
port is gained for the interpretation of the 
training effects as a form of learning. Re- 
sults from the free association originality 
test also show a significant delay effect or 
decrement in originality as the time between 
the initial and final tests is increased from 
one hour to two days. This, or course, is 
another common characteristic of learned 
behavior. 

Results from the Unusual Uses Test 
showed a highly reliable training effect but 
not a delay effect. There was no significant 
decrease in the number of unique responses 
when the delay was increased from one hour 
to two days. Again, this discrepancy in the 
results obtained from the two originality 
tests may be due to a variety of reasons. 
Tt cannot be concluded, however, that а 
reliable decrement in unique uses did not 
occur within the time interval studied. 
Comparison of the delay conditions em- 
ployed in this experiment with the standard 
experimental and control conditions of Exp. 
IT, which received the free association lists 
in the same order, suggests that a decrement 
occurred within the first hour of delay. 

The extent to which the effects of origi- 
nality training persist, or are retained, is 
undoubtedly a. function of many different 
variables of the sort that have been studied 
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in classical experiments on verbal retention. 
Such variables as amount of practice or 
original learning, a wide range of delay 
intervals, activity during the delays, and 
distribution of practice trials, may all in- 
fluence the extent to which the effects of 
originality training persists. 

As data accumulate from parametric 
studies of this sort, and they show that per- 
formance on tests of originality vary in a 
fashion similar to performance in more 
traditional learning situations, to that extent 
it may be assumed that the effects of origi- 
nality training do not differ fundamentally 
from other forms of learning. The study 
reported here is an initial attempt in that 
direction, and lends some support to the 
hypothesis that originality can be learned 
in the same fashion as other forms о! 
operant behavior. 


CONCLUSIONS 


Each of the four experiments employing 
the standard experimental training proced- 
ure with free association materials produced 
a highly reliable increase in the uncommon- 
ness of responses on two different tests. 
The first two experiments suggest that the 
standard experimental training procedure of 
repeatedly evoking different responses to the 
same stimuli is the most successful of the 
three experimental procedures employed. It 
produced significantly greater facilitation of 
originality on one or both originality tests 
as compared with the method of evoking 
different responses by presenting different 
stimuli as in Exp. I, or evoking uncommon 
responses as textual responses as in Exp. П. 
Possible variables responsible for the 
efficacy of the standard training procedure 
have already been indicated earlier in this 
paper and elsewhere (Maltzman, 1960). 


The results of Exp. IV and V lend some 
support to the hypothesis that originality 15 
learned behavior and varies as а function 
of the same antecedent conditions as other 
forms of operant behavior. Effects of orig! 
nality training may persist for as long 2$ 
two days, under the training and test con- 
ditions employed in Exp. У. Originality, at 
least, on the free association test, also varies 
as a function of the number of repetitions 
of the training list. 

Although the standard training proce 
dure has proved to be effective, there are 
many possible variations of the procedure 
that remain to be investigated. For example, 
instead of completing the entire training list 
of stimulus words before re-presenting m 
successive responses to each stimulus mig? 
be evoked. Various combinations of these 
procedures may also be studied. Further 
experimental studies are also needed €f 
ploying different types of test criteria, m 
cluding the more traditional kinds 0 
problem situations. Studies of this kind as 
currently in progress in the UCLA labora 
tory. Work of this sort may indicate t 
extent to which the kind of training pes 
cedure studied here might have practic? 
applicability. In this connection, it sho А 
be noted that the behavior measured on Ет 
originality tests employed in these expe. 
ments and similar tests correlates reliably 
with ratings of original behavior (Dreve@"™ 
1956). к 

In conclusion, it should be noted that E 
term "originality" as employed in this рар 
refers to a particular kind of behav! 


measured by specified operations UP his 


given conditions, If it is objected that b 


is not genuine originality, then it shou 
incumbent upon the critic to specify 
latter behavior in equally operational te,” 
so that it too may be subjected to exP® 

mental study. 
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APPENDIX 


THE CLASSIFICATION OF RESPONSES ON THE UNUSUAL USES TEST 


" I 
The list of common and unique uses that follows was initially obtained from the results of Exp. А 
and was used with minor modifications in the scoring of the Unusual Uses Test from the later exper 


ments. 


distribution for all items was designated as unique. 
Adjustments were made in the initial list by the addition of unique u 
On the other hand a unique use was recl 
Few adjustments of this sort were necessary. 


common. 


in the test of a later experiment. 
occurred more than once on any given subsequent test. 


1. AN AUTOMOBILE TIRE (used on the wheel of an automobile) 


float 

roll 

swing 

bumper 

toy 

smoke 

chair 

obstacle course 

jump on 

scrap rubber 

flower bed (planter) 
weight 

mending (patch) rubber 
weapon (hit someone with, etc.) 


M (garden, gas station, 
etc 


shoe soles 
target 
advertisement 
costume 


prop up bigger object 

holding hub cap 

protect car from bumping paint 

protect axle 

lawn mower wheels 

toss bean bags through 

seat at ball game 

have on fishing boat 

cradle 

base for lamp 

cover 

fort 

door mats 

anchor down poles 

durability demonstration 

cable holder 

battering ram 

padding on walls of amusement 
park 


Соттоп Uses 


ride on 

inner tube (for swimming) 
train dogs 

melt and reshape 
boat dock (bumper) 
wheel on cart 
slingshots 

start fire (burn) 

put in garage 
blockade 

football practices 
insulation 

rubber bands 


throw things through ( football, 
etc.) 


sled 

hoop 

drum head 

supporting another object 

sell for living (retail product) 


Unique Uses 


household rubber material 
use in musical chairs 

keep rug from slipping 
hold air 

test coordination 

skin diver's net 

hold letters or bundles 
seat at beach 

press papers 

pillow 

line doors or furniture 
store refuse in 

bumper cars at beach 

bed for animal 

hold door open 

painter’s still life 

hold back loose dirt 

keep tire salesmen in business 
costume for strippers 


А he 
In the initial classification based on the results of Exp. I any use occurring only once іп p 
Uses occurring more than once were classified ai 


s when they occurred 
ed as common if it 


make smell 

make sandals 

make money 

life saving 

to swim with 
binding to hold tree 
horse equipment in horse show 
play catch with 
draw circles 

buoy 

tunnel 


hold things in lining (hiding 
place, paper, water, CC 


to throw 

lo cut up 

make smog 
sand box 

to break things 
hide narcotics 
shock absorber 


picture frame 

run in to build up legs 
soap box racer brake 
plug a hole 

in a skit 

make wall 


base for bell- ringing pole 
carnival 


trade it 

inner lining for rubber gun 

hang things on 

wedge 

carve on 

make rubber piece for tr 
(fishing) 

keep repair men busy 

roof for stork nest 


olin 


souvenir 
ballast in a boat 


э 
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in relay games 

lift to build muscles 

trap 

rubber strings for tying wire 
hang on tree 

clutter up backyard 

Гог color scheme 

push things down driveway 
make tracks in sand 

use in circus act 

fill with cement for weight 
mold 

capture or hold someone 
build pyramid to jump off 
draw on with chalk 

build tree house 

sunbaths 

texture pieces in art projects 
loan to your neighbor 

put in a continental kit 

hold back flood waters 


build with 
2. A KEY (used to open a lock) 


lever (pry things open) 

weight 

decoration (on chair, etc.) 

clean something (fingernail, 
crack, etc.) 

toy 

Scratch (ear, etc.) 

screwdriver 

cutter 

noise 

charm 

melt for scrap metal 

start car 

pry open (jar, can, door, shoe 
polish, window, etc.) 

key to city | 

scratch (message, desk, wood, 
paint, etc.) 

weapon — 

symbol (religious, advertise- 
ment, etc.) 

necklace 

bracelet 


add to junk box 
Dlier 

chrome plating 
electric shock 
having secret 

Point out something 


throw knives 

makeshift table 

gift 

slice and make holders 

transfer air to bicycle 

sit on while being pulled by 
speedboat 

wagon 

to step on when there’s mud 

to hide behind 

chewing gum (Spain) 

stick pins in 

steal 

wheel barrow 

in a clock 

adjust temperature 

оп а safe 

rubber coverings 


support walnut trays above 
ground 


in zoo for monkey to hang on 
around a horse in a bull fight 


Common Uses 


have (open) diary 

put on key ring or chain 
give to someone 

make new keys 

fish lure 

start mechanical tov 
bookmark 

hobby 

token 

wedge 

jingle (to sound like money) 
secret identification (password) 
rattle 

“index” for map, etc. 
dig (make) hole 
jewelry 

hypnotize 

tap code 

wind clock 

short circuit 

sorority badge (pin) 
tighten skate 


Unique Uses 


scrape rust on car chrome 

ornament on Christmas tree 

symbolize gift of car 

signify two people going to- 
gether 

button 


joke 

make mudpies in middle of 
play tag with 

tear up and use as brake on toys 
dogs to play with 

build fence 


end of hammer to pound out 
dents 


keep wood raft afloat 
tractors 

flatten something 

little boy to play army in 
burn to prevent frost damage 
store unused garden hose 
ash tray 

bird nest 

make an invention 

pile up outside of dealers 
cause accidents 


protective lining for mechanical 
clamp 


bottom of table legs to prevent 
harming floor 


make blueprints 

use in hop-scotch 

look important 

to throw 

tool 

drawing outline 

carving 

hammer with 

pulley for window shade 
conductor 

treasure in treasure hunt 
make designs in sand 
magnet 

gift 

poke hole (in cloth, wood, etc.) 
in game of hide and seek 
paperweight 

clue 

wear 

souvenir (to hang on wall) 
diving and retrieving 


hoop pick (for horses) 
shoot in slingshot 

tap rhythmically 

fuse 

impress people 
studying designs 


20 І. MALTZMAN, S. SIMON, D. RASKIN, лхр L. LICHT 


straighten hairpins 
lift small objects 
give to baby to chew on 
chew on if anxious 
to write with 

door knocker 

award 

unfasten screen 
school remembrance 
scrape 

to press 

paint it gold 

turn on furnace 
thermometer on wall 
engrave in tree 
make print 

pad a lock 

wrap string around 


put on ribbon and tie around 


neck 
to photograph 
cut teeth with 
press down bump 
design on pictures 
weight for small animals 
impression in clay 
evidence in trials 
training dogs 
toothpick 
brace 
break something 
tap on windows or doors 
place marker 
play catch with 
fiddle with 
honorary club 


3. A SAFETY PIN (used for fastening) 


prick skin 

punch hole 

open lock 
decoration 

scratch 

weapon 

fish lure 

remove splinter 
pin 

toy 

toothpick 

sticking to bulletin board 
put things on map 
stick someone 
make chains 
untie knots 
puncture blister 
break balloons 
remove sliver from skin 
decorate sweaters 
hold diaper 
weights 

clean fingernails 
open glue tube 


connect as belt 

base for decorative pin 
measure acoustics 
make doorbells stick 
to keep in pocket 
repair hems 

scrap metal 

cork opener 


Common Uses 


punch holes in (cut) paper 

carve words 

open objects 

clean small objects 

insert elastic (in sleeve, belt, 
object) 

necklace 

pry things open 

hold other pins 

sewing 

tack (something to wall) 

picture hanger 

marker for hopscotch 

jewelry 

puncture tire 

"pick" (poke) at something 

hold dress (skirt) together 

hold papers together 

pull cord through opening 

use as thumb tack 


clean small ledges (corners, 
etc.) 


repair zipper 
key ring 
stick something 


Unique Uses 


make link in S. P. bracelet 
demonstrate magnate 
make fine lines in drawing 
hook for lanyard 
squeezer 

decoration for food 

string tokens on 

hold glasses together 


autique 

jangle to attract attention 
reflect sunlight 

suck on 

put in valentine 

put in scrap book for memory 
object for conversation 

put safety pin in hole and save 
decorate plastic panels 

hat 

melt into buttons 

handle on box 

melt and make ring 

loosen something 

twirl 

pick ear with 

arouse curiosity 


retail product 

put on button with 

pin paper to wall 

decoration for hats or 
(clothes) 

kill insect (bug) 

get dirt out of ring 

engraving (carving) 

make wound (sore) : 

mark stitches in knitting 

make wire (scrap) 

hang up skirt 

hang things on wall 

patterns or designs 

clothes pin 

screwdriver 

repair something 

scratch name in metal 

indicate markers 

pick things apart 

scratch furniture (or © 
objects) 

Stick pin in cushion 

hold buttons 

letter opener 


dresses 


ther 


prop 
poke things that ar 
places 


string pearls 

pull string through narrow 
opening 

hold message on door 

keep window shade up 


e wedge 


d jn 
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mark spot on treasure hunt 

wire joiner 

pierced earrings 

prick babies 

to impair clothes washer 
mechanism 

hold up falling objects 


determine who has next baby at 
shower 


pinpoint object 

cause pants to fall down 
poison dart 

carry in purse 

get money out of piggy bank 
clean top of salt shaker 

bend into party favor 

widen small hole 


play "pin the tail on the 
donkey" 

phonograph needle 

puncture leather 

collar clip 

pick off fingernail polish 

bracelet 

Christmas tree decoration 

belt 

pinch somebody 

bend around something 

use as nail 

torture flies 

stick in a flower 

make small noise 

running elastic 


4. A WATCH (used for telling time) 


decoration 

toy 

demonstrate mechanics 
(paper) weight 

tell direction 

hear (listen to tick) 
pawn (sell) 

prize 

hypnotize 

rhythm 

bracelet 

Гог experiments 
impress people 
draw circles 

timing events 

take apart and fix 
jewelry 

gold content 

picture holder 


used by watchmakers (to make 
living from) 


cover scars on wrist 


melt parts for metal 

money 

learn clockwise vs. counter- 
clockwise 

make dent in arm 

reading meters 

glass for cutting 

irritate person 

silver content 

reassemble gears for other 
purpose 

provide employment 


Common Uses 


suspend fobs on 

gift 

art pieces 

tell date 

time pulse 

to wear 

wealth object 
showmanship (show off) 
learn to be jeweler 

take apart and study parts 
test hearing 

time tests 

teach how to tell time 
time bomb (prank) 

bribe object 

to start conversation with 
to keep a date (appointment) 
timing device for bombs 
use as pin 

set off alarm 

put cheat notes in 

teach to count (numbers) 


Unique Uses 


security for loan 

fiddle with when nervous 

hang on string and find vertical 

use chain to latch door 

use parts—paraffin for wrap- 
ping paper 

speedometers 

to steal 

glass for magnifying 

check resistance to shock 

exercise fingers 

test sight 


clean pipe 

on back of sign 

clean comb 

make scatter pin 
make up weight 
terms of barter 
make cardboard filter 
reshape 

tear material 

remove thread 
stopper holes 
mechanical purposes 
separate stitches in knitting 
dig hole 

stick butterfly on it 
play poker with 


check resistance to water 
noise to keep someone awake 
advertising 

throw at someone 
metronome 

to sell 

tell weather 

(hang from) chain 

ticking sound to put to sleep 
smuggle (conceal) (jewels) 
collect as hobby 

family heirloom 

souvenir 

to listen to 

to annoy somebody 

strap things together 
ticking to soothe puppy 
noise from ticking 

comfort a (motherless) cat 
looks 


loan to magician who pretends 
(tricks) to break it. 


home appliance 
guessing lengths of time 
mechanics to fool with 
lure to catch thief 
swing in self-defense 
knuckle guards in fight 
to expand industry 
forecast future 

symbol 

clean fingernail with stem 
training item in school 
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projectile for throwing 
investment 

window displays 
reward 


split apart for modern abstrac- 
tion 


mantelpiece 

tinkering 

wrist warmer 

source of frustration when not 
running 

starting signal 

pounding something 

vent destructive instincts on it 

back as mirror 

hold something on wrist 

get spare parts 

open something 

teach someone how to make one 

weight measure 


parts to decorate fish bowl 

squash ants 

sound effect in movie 

keeps jewelers happy 

hide birthmark 

threatening someone 

practice in amplifying sound 

turn on lights 

turn on pump 

hide stolen article 

puzzle 

remembrance 

give seasons 

symmetrically suspended center 
piece 

chocolate candy formed as 
watch 

sound vibration experiment 

used to tell when you are near 
magnet 


5. A BUTTON (used to fasten things) 


decoration (shoes, handbags, 
etc.) 


toy (play game) 

play money 

draw circle 

necklace 

earrings 

poker chips (gambling) 
means of exchange 
make objects (picture, etc.) 
Stop up sink 

hobby 

teach how to count 
practice sewing 

to throw (at people) 


pry open cans, jars, etc, 

to roll 

souvenirs 

paperweight 

color board 

for beauty or looks 

swing from swing to scare flies 
pressure object to stop bleeding 
golf marker 

article in visual demonstration 
map marker 

to paint 

stuff chair 

pin 

construct pendulum 


Common Uses 


lucky pieces 

badge (emblem) 

checkers 

bingo 

make rattles 

noses for dolls, etc. 

make designs (patterns, etc.) 

sell for money 

make eyes (on dolls, on animals, 
snowmen, etc.) 

learn colors from 

slug 

small target 

to suck (chew) on 

monocle 


Unique Uses 


to flip as coin 

put texture on paintings 

use up thread 

demonstrate durability 

signal 

teething for babies 

weight 

hold name tag 

stop swelling of bump 

Christmas tree decoration 

Separate papers (wedge, etc.) 

prize 

historical value 

E handicapped children self- 
help 


break and use parts for confetti 

practice putting things together 

reassure baby chicks 

weight on tie to keep it in place 

guard a vault 

hang medal from 

wear around neck to hold collar 
in place 

play passing game т 

decorate with parts in plastic 
panels 

take out springs and bounce 
them around 

buy bands for 

spend money fixing 

in place of grandfather clock 

take hands off and use in 
another watch 1 

drop from high distance to tes 
breakage 


trade object 
bracelet 


teach children to fa 
(dress themselves) 


make noise (with many) 
play with when nervous 
make jewelry 

fill button (sewing) box 
"ivy league look" 

hide tear or spot (on dress) 
hide a hole (in wall, etc.) 
in counters 


put string through it (bends, 
etc.) 


hold something together а 
advertise for office in elect! 


sten things 


: oles 
sift bread crumbs through h 


stuff dolls 

make faces with 

shimmy 

stuff bean bag 

mark of crosswalk 

word in word assimilation 

tests 

memories of clothes 

Гог art projects wher’ ] 

hang from string to 566 У 
wind blows 

outline pictures 

clue 


swallow to get sick 


Е; 
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put in fish pond 
evidence in a trial 


in educational books for 
children 


trip someone 

build houses 

end of string attached to light 
play catch with 

knob on radio 

token of appreciation 

put in muffins for raisins 


fingernails on snowman 
remembrance | 

put in goulash for joke 
throw in shallow pond for color 
outlet for creative design 
nervous fingers to fidget with 
fishing plug 

wheels of a car 

show authority 

to trip on 

designate political party 


6. EYEGLASSES (used to improve vision) 


change appearance (looks) 

protect eyes (from glares, 
sparks, etc.) 

magnify 

disguise (hide behind, etc.) 

acting (plays) 

cutting 

keep out of fights 

toy 

decoration 

hearing aids 

keep out sun 

look intelligent (professional, 
etc.) 

for reading (studying, etc.) 

start fires 

prop 


shield metal particles 
pane for peephole 
doll house window 
bottle caps 

put pencil in 

tap to be annoying 
give self-confidence 
as fad 

test vision 

look at atom test 
crush glass and use as dog 


poison р 
spend parents’ money 


decorate with shells 
attract attention 


Common Uses 


masquerade costume 
comedy 


when you want to break some- 
thing (release emotions) 


make eyes look larger (or 
smaller) 

put into a case 

sophistication (look distin- 
guished) 

look distinctive or old 

telescope (binoculars, etc.) 

microscope 

mirror 

gesture object 

worsen vision 

give false impression (imper- 
sonations) 

hurt looks 


Unique Uses 


learn to use radius of curvature 
give people complex 

scare away men 

give to charity 

sun reflector to send signals 
hold false nose on 

frame for spare wire 

add color 

spend money foolishly 

hang on statues 

to finger when nervous 

excuse for not obeying signs 
design frame for 

construct them 

melt down glass for better use 


button box for invalids 
make clip for hair 

make knob on lid 

eyes in gingerbread cookies 
pledge girlfriend with 

hold the car door in place 


causes embarrassment when 
they fall off 

toy soldiers 

put on string for cat to play 
with 

make pools in a lake 


place on head to hold hair down 


protect eyes from smog (dust, 
etc.) 


relieve headaches 
fashion mode 

art object 

“gag gift” 

hide black eye 

rest eyes 

lenses for experiment 
paperweight 

weapon 

jewelry 


chew on ends (during exams, 
etc.) 


impress people 
for amusement 
carry in purse 


scratch head with rims 

draw circle 

melt and analyze composition 
rinse the eyes 

replace human eye 

book mark 

conversation piece 


reach object that has fallen in 
narrow hole 


advertising 

cut oneself 

make living from 

depict certain personality 
to break and cost money 
hide hangover 
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PERSONALITY FUNCTIONS OF 
SYMBOLIC SEXUAL AROUSAL TO MUSIC? 


MICHAEL A. WALLACH? AN» CAROL GREENBERG 
Harvard University and the Age Center of New England, Inc. 


ow does music psychologically affect 
H the listener? In most of the research 
to date done with the aim of answering this 
question, evaluation of psychological effects 
has depended on overt judgments by the 
listener concerning the music's emotional 
meaning for him. This tradition of studies 
has been recently summarized by Farns- 
worth (1958), and work by Sopchak (1955, 
1957) provides a current example of the 
technique. Its use seems based on the his- 
torical linkage of experimental aesthetics 
with the introspective method, and would 
appear to neglect our knowledge of the in- 
accuracy often involved in conscious reports 
of motivational states. 

Freud (1938a) has suggested that art in 
general can serve as a way of sublimating 
sexual energy, thereby channeling it away 
from its more customary social expression. 
Music, in these terms, could function as an 
indirect, compensatory means of expressing 
sexual arousal. Sublimation's relevance to 
music is well expressed in Erikson's (1958) 
description of Martin Luther: 


Most of all, this kind of person must shy away 
from intimacy. Any physical closeness, with either 


! This study was supported in part by a research 
grant (M-2269) to the senior author from the 
National Institute of Mental Health, United States 
Public Health Service, conducted under the auspices 
of the Age Center of New England, Inc.; and in 
part by а grant from the Laboratory of Social 
Relations, Harvard University. Grateful thanks 
are due the members of the senior author's Har- 
vard University course and seminar on the psy- 
chological foundations of aesthetics, for aid with 
various aspects of the study ; and Donald S. Dunbar 
of Simmons College, for making classes available. 
Lise Wallach made invaluable critical. comments 
on the manuscript, and drafts were also read by 
Albert J. Caron, Ellen Greenberger, and Nathan 


Kogan. | 
? Now at Massachusetts Institute of Technology. 


sex, arouses at the same time both an impulse to 
merge with the other person айа a fear of losing 
autonomy and individuation. . . . The contrast be- 
tween the exalted sexual fusion of his autoerotic 
dreams and the complete sense of isolation in the 
presence of the other sex is catastrophic. . . . 
Musie, at such times, can be a very important 
means of socializing and yet of communing with 
one's emotions: and Martin was a master in using 
music as a bridge to others, and also as a means of 
creating distance (pp. 101-102). 

So also, Sullivan (1949) repeatedly illus- 
trates how Beethoven’s emotional involve- 
ment with music performed a displacement 
function to compensate for his sexual with- 
drawal from other people. . 

Insofar as the motivational functions of 
music may carry sexual meanings, we can 
hope to understand them only through ex- 
ploration at a more unconscious level, for 
two basic reasons. Even with the fullest 
intention of being candid, a person may be 
unable to report veridically on his sexual 
feelings because of their deep-lying connec- 
tions with early fears. Secondly, societal 
conventions are such as strongly to inhibit 
such reporting even if a person actually is 
aware of these feelings. 

Erikson's implied analogy between the 
psychological functions of dreams and those 
of music suggests, further, the following 
way of assessing music's unconscious moti- 
vational effects. The kinds of symbolism 
studied by Freud (1938b), in order to com- 
prehend the latent sexual significance of 
dreams, may also help us to evaluate the 
possible sexual import of music. Toward 
this end, the study to be reported used vari- 
ations of techniques developed by Clark 
(1955) and Beardslee and Fogelson (1958) 
as its point of departure. In brief, a listener 
was asked to let himself get into the mood 
of the music, and was told that after hear- 


ing the selection he would be requested to 
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write a four-minute story which the music 
had suggested to him. Content analysis of 
the story for activities and things presum- 
ably symbolic of sexual arousal (in terms 
of Freudian assumptions) then permitted 
an evaluation of that selection's implicit 
sexual motivation effect for the listener in 
question. 

The aim of the present research was to 
determine personality dimensions respon- 
sible for the effects music might exercise on 
symbolically expressed sexual motivation. 
The experiment's starting point was a psy- 
chological theory of aesthetics presented 
elsewhere (Wallach, 1959), which calls for 
a functional analysis of music—a study of 
what music accomplishes for the organism 
in terms of its needs. Our earlier consid- 
erations have suggested that music can 
function as a channel for the displacement 
of sexual motivation inhibited from more 
direct expression. What are the conditions 
of personality under which music will serve 
this function for the listener? 

Clinical discussions of the reasons for 
displacement of sexual drive to fantasy 
levels (for example, Fenichel, 1945) suggest 
that such indirect fulfillment becomes neces- 
sary when, as in the case of Erikson's de- 
scription of Luther, the individual feels a 
need for other people, but is afraid of them 
—by virtue of previous punishments or 
their threat. He therefore avoids closeness 
with others, but the need is sufficiently 
strong that his defense of social isolation 
cannot really work. The sign that such a 
person's defense of avowed introversion 
doesn't function successfully is the presence 
of a high level of anxiety. 

Consider, on the other hand, someone 
who likewise professes a desire to be alone, 
but does not in addition manifest a high 
degree of anxiety. Such a lack of anxiety 
would seem to imply that this person's atti- 
tude of avoiding others is a much more 
stable adaptation. This individual, we would 
expect, hence would not need to use music 
as a source of substitute satisfaction in the 
manner just described for the introvert who 
is high in anxiety level. 


Finally, we might well expect a person 
who avows a high degree of concern with 


other people—a social extravert—also to re- 
flect this involvement to some extent on the 
fantasy level because of that involvement's 
very intensity. Such fantasy expression 
should be weaker than that energized by а 
displacement function, however, since in the 
latter case fantasy expression is the only 
channel available to the individual, while a 
social extravert is also able to express him- 
self more directly. Since introversion seems 
to be a more “defensive” and extraversion 
a more “normal” orientation, anxiety more 
clearly seems to be an index of the success 
or failure of a defense in the case of intro" 
version than in the case of extraversion. 

More fantasy expression of sexual nee s 
should occur for a social introvert, then, ! 
the person has a high anxiety level, thus 
betraying a lack of success for his defense 
of social isolation. The result shoul 
fantasy compensation. Рог introverts whose 
anxiety level is low, on the other hand, the 
apparent success of the introversion defense 
precludes the need for their engaging E 
fantasy displacement. Since social extra 
verts, finally, should show some symbol 
sexual arousal simply as a reflection of the! 
involvement with others, we would expe 
them to fall between the two intro 
groups in level of symbolic sexual arois 

In sum, the theoretical proposals we hav 
been advancing are these: 

1. One of the functions of listening, {0 
music is to permit the symbolic expressio 
of sexual impulses. A 

2. The strongest impetus toward such e 
pression arises for individuals who ae 
that they want nothing to do with other" 
but whose attitude of social introversio? 13 
unsuccessful as a mode of defense, as 1” 
cated by the presence of a high leve! "i 
anxiety. Since direct expression of sexu? 
impulses is thereby prevented but the лел, 
for others still remains, the only avail? 
channel is one of displacement. 

3. A weaker impetus toward such be 
pression arises for individuals who desc g 
themselves as socially extraverted. | s 
direct expression of sexual impulses 15, 
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rather than of displacement. Not being the 
only channel of sexual expression, the in- 
tensity of symbolic arousal should be less 
than in the first case. 

4. The weakest impetus toward such ex- 
pression is present for individuals whose 
expressed defense of social introversion is 
successful, as indicated by the absence of 
anxiety, since its success implies that the 
need for other people has been conquered. 
Hence, a minimum of symbolic expression 
is required. 

The following hypotheses may be derived 
from the above propositions, and constitute 
the experimental questions of the present 
study : 

1. Among those high in admitted anxiety 
level, subjects calling themselves socially 
introverted will show a greater degree of 
symbolic sexual arousal to music than sub- 
jects calling themselves socially extraverted. 

2. Among those low in admitted anxiety 
level, subjects calling themselves socially 
introverted will show a lesser degree of 
symbolic sexual arousal to music than sub- 
jects calling themselves socially extraverted. 


METHOD 


Selection of Music and Independent 
Judgments of Arousal Value 
ayed to a Harvard- 


Three jazz selections were p! 
Radcliffe class of 10 male and 13 female under- 
graduates who were asked to evaluate them in 
terms of sexual arousal. After hearing each piece, 
they were to rate it on a seven-point absolute scale 
of arousal value ranging from “most arousing” at 
one end, to “least arousing” at the other. A de- 
rivation of judges’ rank orderings was made from 
the position of each of the three pieces on this 
scale. Rank order rather than absolute scores were 
used in order to control for differences among 
judges in the position of all three pieces in absolute 
arousal level. 
The significance of the mean difference E 
ranks given each pair of pieces Was — € 
for each Sex separately and both combine 8 e 
being an individual's rank 


i es 
measure in all cas e г \ 
difference for two selections. On the basis of this 


analysis reported in detail below, it was found that 


one of the three pieces was judged lower in sexual 


arousal value than the others, 
were not distinguishable in tha 
of presentation of these pieces 
ingly, the piece that was lower 


value being played first. The aim of this order was 
to preclude any possible halo effect arising from 
the transfer of a high arousal piece’s imagery to a 
subsequently heard low arousal selection.? 

The selections heard, in their order of presenta- 
tion, were Eddie Burt's "Falling in Love," Dizzy 
Gillespie's “Doodlin’” (these two pieces each fad- 
ing out right before the restatement of the theme), 
and Sam Taylor's "Sam's Blues." The lengths of 
these selections were 185, 147, and 146 seconds, re- 
spectively, and they were played from a high 
fidelity tape.* 


Subjects 


Y A total of 98 undergraduate women participated 
in the story session part of the study—students in 
introductory psychology classes at Simmons Col- 
lege. The sample was restricted to women because 
Beardslee and Fogelson's work indicated women to 
be more open to symbolic sexual arousal by music 
than men, a result whose meaning can be better 
understood in the light of our own findings below. 
For two reasons, however, the actual sample con- 
sidered in data analyses was always smaller than 
98. First of all, three class sessions were required 
to gather the data. Since these sessions were sepa- 
rated by intervals of about a month, different 
absences occurred in each, thus reducing the effec- 
tive sample size to those who were present during 
all three hours for purposes of certain analyses, or 
during two of the three hours for purposes of 
others. A second cause of reduced sample size 
was the need to equalize cell membership in the 
two-way analyses of variance reported later. This 
required random elimination of cases from the 


oversize cells. 


Story Session 


During the first class session, the subjects (Ss) 
wrote stories in response to the musical selections. 
At the outset they heard the following instructions : 

This is a test of creative imagination. It gives 
you an opportunity to use your imagination, to 
show how you can create things as suggested to 
you by music that you hear. You are going to 
hear several musical selections. Try to let the 
mood of each one come over you completely— 
just feel whatever the music makes you feel; 
get completely involved in it—lose yourself com- 
pletely in the music. 

In a minute I will play the first selection. 
Remember, just lose yourself completely in the 
music—let its mood really come over you. After 
you hear the music you will be asked to write a 
story suggested to you by this music. The story 


3 The pieces were also in this order when heard 
by the independent judges. 

4 Their respective record numbers are Jazztone 
1223, Verve MG-V-8242, and MGM E-293. 
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can be of any kind, but it should be suggested 
to you by the music you have just heard. You 
may write notes during the music if you want 
to, but don't begin to write the story itself until 
the music is over. Remember, this is a test of 
creative imagination. We will play the first 
selection now; try to lose yourself completely in 
it When it is over you will start to write the 
first story. 

After hearing the piece, the following was read: 

Now you have four minutes in which to write 

a story suggested to you by the music that you 

have just heard. Please write it on the first page 

of your booklet. 

Instructions concerning the second and third 
selections were abridged but otherwise similar 
forms of the passages just presented. Each story 
was written on a separate page of the test booklet, 
S having a blank page in front of her while 
listening to a piece. The stories obtained were 
scored for symbolic sexual arousal in а manner 
described below. 


Preference Session 


А second class administration occurred about а 
month later. Its purpose was to obtain a judgment 
of absolute level of preference for each of the 
pieces which had been heard before. Absolute 
preference levels were needed in order to control 
for the possibility that relations of personality to 
arousal would occur only for people who generally 
liked music of this kind. 

The Ss received a sheet with three Seven-point 
scales labeled “A,” “В,” and "C," respectively; the 
left end of each scale having "Strongly Dislike" 


printed above it, and the right end, "Strongly 
Like" This preference session was introduced as 
follows : 


І am going to play several musical selections 
which you have already heard. After you listen 
to each one, I want you to tell me how much 
you like or dislike it, by circling one of the 
numbers along the row. If you dislike the piece 
very much, you would circle 1; if you like it 
very much, you would circle 7; if you were in- 
different to it, you would circle 4. And so on 
for the other numbers: if you liked it a little, you 
would circle 5; if you liked it quite a bit, circle 
6; and if you liked it very much, circle 7. Оп 
the other hand, if you disliked it a little, circle 3; 
if you disliked it quite a bit, circle 2; and if you 
disliked it very much, circle 1. 

The first musical selection will be called “A,” 
so you circle a number on the first line. The 
second piece is “B,” so you mark the second line 
for that piece; and, likewise, the third piece is 
“С” 

Don't feel that you should express any particu- 
lar kinds of likes or dislikes, since there obvi- 
ously are no right or wrong answers. Some of 
you may tend to like all the pieces; some of you 
may tend to dislike all of them; still others of 


you may like one piece more, dislike another one, 

and so on. Just be sure that each of your 

markings expresses your true feeling about the 
piece you just heard. 
Now you will hear Selection A. 

After listening to this selection, S was asked to 
mark how she felt about Piece A. This process 
was then repeated for the second and third pieces. 
The same order of presentation for the pieces WaS 
used here as in the story session. 


Personality Assessment Session 


: У s- 
The hypotheses stated earlier required us ec 


sess two dimensions of personality: the indivi E 
level of admitted or manifest anxiety and her pros 
fessed attitude of social extraversion or ілігоүег. 
sion. Examination оѓ Eysenck's Maudsley S 
ality Inventory (Eysenck, 1956; Jensen, 19 о 
indicated that it provided highly suitable ver 
measuring each. The test contains two scales M 
in the present study. The Neuroticism scale, € 
taining such items as "Are you inclined (0 the 
moody?” was found to correlate highly wt aea 
Taylor Manifest Anxiety Scale (1953), and hi adi- 
provides an index of the extent to which an ке 
vidual believes himself to be anxious. The sit оп 
version scale, containing such items as “Woul АЕ 
be very unhappy if you were prevented fron! T lf 
ing numerous social contacts?" correlated Vene 
with the Social Extraversion scale of the Lum 
sota T-S-E Inventory (Evans & McCon 
1957), the correlation being much smaller ec- 
that test's Emotional Extraversion scale. Inston 
tion of the Maudsley Inventory's extrave ial 
items showed them clearly to concern the иг 
rather than the emotional forms of extravers c, 
introversion—i.e, to concern desire for the s of 
pany of others rather than avowed opennes 
emotional expression. ed’ 
While our hypotheses specifically nine 
social extraversion, it was of interest to dete” trae 
the extent to which emotional and social E 
version in fact differed. An index of entes ү 
extraversion, therefore, was also added Ю tra 
test battery, in order that both kinds of Fg 
version would be measured. The Emotional was 
traversion scale from the Minnesota T-S- mly 
used for this purpose, its items being rando] 
interspersed among the manifest anxiety ап he 
extraversion items of the Maudsley test- the 
Eysenck Extraversion scale was preferred Hs ust 
Social Extraversion scale from the T-S-E, 006 its 
the former was developed more recently "ly 
relations with other instruments more thoro" yas 
investigated. The whole untitled questionnaire rt 


concern 


administered during a third class hour à curo 
month after the preference session. The ^ sley 
icism and Extraversion scales of the Май. е 


test and the Emotional Extraversion scale 9" qe- 
Minnesota T-S-E were scored in the ways 
Scribed by these tests' respective authors. 

Ss identified themselves by name on the P 
dures used in each of the three class periods. 


oce 


| 
| 
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Scoring Categories for the Stories 


Each story was scored for presence or absence 
of five variables. These variables may be briefly 
described as follows :5 

т. Motion. This variable was scored if the story 
described movement by the body as a whole or by 
any part of the body from the shoulders down, 
hence excluding head, arms, and hands. The 
activity of dancing, however, was not scored. 

2. Rhythm. Rhythm was scored for any repeti- 
tive action, sound, or adverb phrase such as “back 
and forth.” Verbs concerning progressive move- 
ment over space, such as “dancing” or “walking,” 
were scored for rhythm only if a modifier was 
added indicating that the movement was done with 
rhythmic repetition, such as “he walked with even 
paces.” 

3. Peak. A story was scored for peak if there 
was a rise followed by a decline in the level of 
tension implied by the action. 

4. Penetration, This category was scored for 
imagery concerning forceful pushing through a 
т ant medium or one which was treated as 
resistant in the story. 

5. Symbolic content. The definition of sexually 
symbolic content was based on Gutheil's formaliza- 
tion (1951, Ch. 3) of Freudian dream theory 
(Freud, 19385). With the exception of dancing, 
an item of story content was considered sexually 
symbolic if it appeared on Gutheil's list, and the 
scorer was asked to decide on presence or absence 
of symbolic content in the story. і 

The view that these variables concern symbolic 
sexual arousal depends on symbol interpretation 
assumptions made initially by Freud. Strictly 
speaking, the link from our analysis of the stories 


to symbolic sexual arousal is assumed, not tested, 


by the present study. Те has been experimentally 
confirmed, however, in other studies, of which we 
may cite, for instance, the work of Clark (1955). 
One could, on the other hand, avoid this assump- 
tion, simply consider our scoring categories a5 
indicative of some sort of motivational arousal, 


— 

5 Scoring categories for the first four variables 
were taken from Beardslee and Fogelson (1958), 
except for the decision not to score dancing for the 
motion variable. This decision was based on the 
generally widespread appearance of dancing 1mag- 
1 all stories, due, apparently, to direct associa- 
he nightclub or social сана Шей п 
hich jazz typically is heard. Scoring for the 1 
e eyed from Clark (1955), with two 
changes : instead of scoring for strong E, 
Some presence, OF absence of symbolic sexual con- 
tent, а two-category judgment of presence 
absence was made. Secondly, content n 
dancing was not scored, for Я the reason jus 
described. More detailed descriptions of the scoring 
categories than those presented are available upon 


request from the senior author. 


ery it 
tions to t 


and interpret the relationships obtained between 
personality and arousal from this starting point. 
In particular, one might provide an interpretation 
in terms of such a motivational dimension as 
emotionality-rationality, or  subjectiveness-objec- 
tiveness. No evidence within the present research 
overrules this possibility. But since such an ap- 
proach would render the obtained relationships 
hard to understand, and indeed would not have 
generated the theoretical propositions which 
formed the impetus for our hypotheses and experi- 
ment, we wonder at its advisability here; especially 
since the assumption of symbolic sexual arousal is 
rendered tenable by such independent experimental 
evidence as the study just mentioned. 

Two points internal to the present experiment 
provide further support for the assumption, The 
symbolic content category is very hard to explain 
in terms other than sexual symbolism, and its 
trends across musical selections are consistent with 
those of all four remaining scoring categories (as 
presented in detail in the following section). The 
relevance of these five variables to sexual arousal 
is also indicated by the general correspondence 
between the scores they yield for the sexual 
arousal value of each piece and independent ex- 
plicit judgments of these pieces' relative value for 
such arousal (also presented in detail below). 

Since there were three stories per person, each 
person's total score could vary from zero (absence 
of all five variables in all three stories) to 15 
(presence of all five variables in all three stories), 
the possible score for a single story ranging from 
zero to five. All the stories were scored "blindly" 
by two independent scorers. As the total number 
of Ss writing stories was 98, there were 294 stories 
in all, The stories were assigned the numbers one 
to 204 in a random order, each story appearing on 
a separate piece of paper and being identified to 
the scorer only by this number. A scorer thus had 
no idea which stories were written by the same 
individual, nor which stories were written to each 
of the three musical selections. A number code 
indicating which three randomly assigned numbers 
corresponded to the three successive stories for each 
individual permitted the eventual ordering of story 
scores in terms of individuals and musical pieces. 


RESULTS AND Discussion 


Independent Judgments of Arousal Value 


Table 1 presents the rank ordering of 
our three musical selections for sexual 
arousal value, using a sample of Harvard- 
Radcliffe undergraduates as judges. The 
analysis in columns four to six is based on 
the mean difference between the ranks given 
each pair of pieces, the two-tailed proba- 
bility level in each case indicating the likeli- 
hood of obtaining this great a deviation 
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TABLE 1 


INDEPENDENT JUDGES’ MEAN RANKINGS OF THE MUSICAL SELECTIONS FOR SEXUAL AROUSAL VALU А 


— A —— PRG 


Mean Rank Test of Mean Difference between Ranks 
= | | " 
Selection | | d A | m 
| & B б ا‎ 
i p< | t p< | i 
Men 2.8 19 13 | 2.62 .05 | 9.06 ох | — 1.60 
(N = 10) 
Women 2.6 1.7 1.7 2.76 .02 | 2.37 ‚05 0.00 
(№ = 13) | 
| 
Combined 2.7 1.8 1:5 3.64 .01 | и фи | 1.04 
(N = 23) | | | 
| | —— 9 


Note.—A rank of 1 indicates highest arousal value; a rank of 3 indicates lowest arousal value. 


from zero by chance alone. The mean ranks 
in columns one to three are presented to 
show the magnitude of the differences, but 
are not used in further calculations since 
the appropriate statistical test requires the 
use of rank differences for individuals. It 
is apparent that both men and women make 
similar arousal value judgments: for each 
Sex, Selection A being considered least 
arousing, with B and C higher and not dif- 
ferentiated in arousal value. 


Reliability of Story Scoring 


Scoring reliability was determined on the 
basis of the largest possible sample— 
namely, all 98 S's who had the story session. 
Two scorers independently rated the mate- 
rials on each of the five scoring variables 
discussed above. As each $ wrote three 
stories, and as the score for each variable 
consisted in a judgment of presence or 
absence, a scorer made a presence or 
absence rating for 294 stories in the case of 
each variable. The measure of reliability in 
scoring any one variable was the number of 
stories for which both scorers agreed in 
making a presence or absence decision di- 
vided by the total number of stories, or 294, 
This quotient, expressed as a percentage of 
the total, was 84% for motion, 88% for 


It might be noted that one scorer was male 
and the other female. 


rhythm, 88% for peak, 98% for penc 
tion, and 7695 for symbolic content. five 
average reliability of scoring for =. the 
variables combined was 86.8%. One о ee 
scorers resolved those cases where disag” 
ment had occurred. 


"nm 
Stimulus Value of the Music and Arous 


e 

To what extent did the implicit meas 
of arousal obtained from the story M 
of our sample of 98 women reflect the 1 the 
pendent judges’ explicit decisions as ‘lues? 
selections! comparative arousal м 07 
Table 2 presents the findings separately | as 
each of the five scoring variables as Wê first 
for all combined. The means of the the 
three columns indicate that for each E m 
variables, and especially for all of 1 wa 
taken together, symbolic sexual arousa e B 
lowest for Selection A, highest for РИС 
and in between for Piece С. 

Are these trends reliable? The a” th 
was provided in a manner analogous к ed 
used for the independent judges descia all 
earlier. For each variable alone, and о дед 
combined, the difference was deter™ wo 
between an individual's scores for еас {hef 
selections. A t test then indicated W jan 
this difference was significantly d€ Tue? 
from zero. From the obtained t М. 2. 
presented іп columns four to six of Та wee! 
we find the arousal difference bet 


f 
we 
5 at 
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Pieces B and A significant beyond the .01 
or .001 levels for four of the five scoring 
variables and for all combined. The differ- 
ence in arousal between Pieces C and A 
passes the .02 level for two variables and 
the .001 level for a third, also being beyond 
the .001 level for the combination of all 
five. Finally, while the difference between 
Selections C and B is significant for only 
one of the five variables taken separately, 
its significance level for that variable is well 
beyond .01, and the difference for all vari- 
ables combined passes the .05 level. 

The statistical evidence hence suggests 
that the order of increasing arousal value 
for these selections is A, C, B. While 
Piece А was also placed at the low extreme 
of arousal value by our independent judges, 
they did not make a discrimination be- 
tween the two remaining selections but 
rather considered them about equally high. 
On the whole, however, the match was 
quite good between these explicit and im- 
plicit measures of the sexual arousal of 
musical selections obtained from independ- 
ent samples of Ss. e 

In sum, the results just presented indi- 
cate that the music is not incidental to the 
arousal content found in our story produc- 
tions. Rather, such arousal reflects inde- 
pendently made explicit judgments as to the 
differing sexual arousal value of the selec- 
tions. Finally, no cumulative series effect 
seems able to account for the differing de- 


grees of arousal content for stories written 
to the three pieces, since the arousal level 
for the selection heard last was lower than 
that for the preceding selection. 


Techniques of Analysis for 
Personality Effects 


Before turning to the findings on person- 
ality, some general considerations are neces- 
sary concerning the analysis techniques 
used. All the two-way analyses of variance 
reported, whatever the dependent variable 
in questton, involved the use of some two 
of the following four dimensions as the 
independent variables whose effects were to 
be tested: manifest anxiety level, social 
extraversion-introversion, emotional extra- 
version-introversion, and absolute prefer- 
ence level. Each analysis required the con- 
struction of four groups representing the 
following combinations of any two of these 
independent variables: groups that were 
high-high, high-low, low-high, and low- 
low. Accordingly, each variable was dichot- 
omized by splitting the sample into highs 
and lows, the split being as close to the 
median as possible. The split could never 
be exactly at the median because, in the 
case of each variable, there were a number 
of Ss tied for a score that overlapped the 
median. In all four cases the overlap 
around the median was asymmetrical: that 
is, most individuals having the score value 


TABLE 2 
2: d SCORES FOR THE MUSICAL SELECTIONS 
99 ;NTAL SAMPLE’S MEAN AROUSAL 
EXPERIMENTA i 
Mean Arousal Score Test of Mean Difference between Arousal Scores 
ean Ar 
Selection ВА СА CB 
: ү É t p< t p< t p< 
Motoh. 0.26 0.49 0.45 3.45 .001 TE. TE 
Мон 0.13 0.33 0.15 3.36.01 i ns 3.07 01 
i 0.08 0.23 0.21 ss s | 2e 9 0.35 m 
E З 1 1.52 ns : 4 
а Hd ut We а 001 4.06 .001 1.29: ms 
Symbolic Content x & А 
i 0.72 1.69 1.34 6.22 .001 4.09  .001 2.49 .05 
Гоа! .72 . 
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in question fell on one side of the median 
and only a few on the other. 


Because of this asymmetrical nature of 
the overlap, and because of the other fac- 
tors already mentioned which necessarily 
resulted in reduction of sample size, it was 
inadvisable to drop a whole group of cases 
just because they all had a score value 
which fell at the median of a distribution. 
The groups were dichotomized so as to 
yield as nearly equal halves as possible, the 
actual resultant cutting point for highs and 
lows deviating no more than 596 from the 
ideal of a 5096-5096 split. That is, consid- 
ering the four variables that were dichot- 
omized, the extent of the cutting point's 
variation from half-and-half never exceeded 
4596-5596 break, and indeed was closer to 
half-and-half than that in all but one of the 
four splits made. Thus, the four groups 
examined in any analysis of variance repre- 
sented, for the two independent variables 
of the analysis, those individuals above the 
cutting point for both, below it for both, 
and above it for one and below it for the 
other. 

Since two-way analysis of variance with 
more than one $ per cell is very difficult 
unless the numbers of Ss in the various 
cells are equal, the cell size for an analysis 
was taken as the smallest number of Ss 
falling in any one of the four cells. Any Ss 
in excess of that number were removed 
from the remaining three cells by а process 
of random elimination. The variances of 
the four groups constituting each two-by- 
two table described were found to be 
homogeneous by Bartlett’s test (Snedecor, 
1946, Pp. 249-250), thus meeting the as- 
sumptions of analysis of variance technique 
in all cases, Finally, the fixed constants 
model (McNemar, 1955, pp. 304-306) was 
used in the analyses, since none of the 
bases of classification involved any random 
sampling. According to this model, the 
within-cells mean Square is used as the error 
term in calculating F ratios for rows, col- 
umns, and interaction effects, | 


The most deviant break, namely, that of 45%— 
55%, occurred for the variable of absolute prefer- 
ence level. 
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TABLE 3 


Mean LEVELS ОЕ AROUSAL TO Music FoR ANXIETY 


AND PRE ENCE GROUPS 
(N = 13 per cell) 


High Anxiety | Low Anxiety 


High Preference iM 
Low Preference 5 


Manifest Anxiety Level and Social 
Extraversion-Introversion 


We turn now to the experiment's major 
concern. Our hypotheses, it will be recalled, 
postulated an interactive effect of manifest 
anxiety and social extraversion on symbolic 
sexual arousability to music, Before dé 
termining whether this interaction occurret А 
however, it was necessary to investigate 
whether differences in absolute levels p: 
preference for the jazz music as a who 
might act to influence the relationship 08 
tween these personality dimensions ай 
arousal to such music. Accordingly, a? 
ferential arousability first had to be 2556556 
separately in terms not only of an inc И 
vidual's position on each personality dimen 
sion, but also in terms of whether ne 
tended, on the whole, to be high or low in 
her liking for the jazz music heard. А pe 
sonality effect might conceivably be demon" 
strable for individuals liking the mus!“ 
but not for those disliking it. | 

Tables 3 and 4 present the means anc 
analysis of variance for those high and m 
in manifest anxiety level, the Ss also bein 


TABLE 4 


ape FOR 
ANALYSIS OF VARIANCE, AROUSAL TO MUSIC Ё 


ANXIETY AND PREFERENCE GROUPS 


(№ = 52) 23 
Source df Mean F p= 
Square E 
Anxiety 1 40.69 8.71 A 
Preference 1 0.08 | 0.02 Na 
Interaction 1 1.23 0.26 ue 
Within Cells 48 4.67 


ER, 
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TABLE 5 


MEAN LEVELS OF AROUSAL TO MUSIC FOR SOCIAL 
EXTRAVERSION AND PREFERENCE GROUPS 


(N = И per cell) 


TABLE 7 


Mean LEVELS OF AROUSAL TO Music РОК SOCIAL 
EXTRAVERSION AND ANXIETY GROUPS 


(№ = 15 per cell) 


High Social High Social High Social High Social 

Extraversion Introversion Extraversion | Introversion 
High Preference 3.45 5.21 High Anxiety 3.33 6.13 
Low Preference 4.27 3.91 Low Anxiety 3.93 2.67 


of high vs. low level of 
absolute preference for the jazz. The same 
kind of analysis is shown in Tables 5 and 6 
for social extraverts VS. social introverts. 
The preference level scores in each case 
were built as follows. An individual's abso- 
lute preference ratings for each piece on 
the seven-point scale were added to form a 
single combined estimate of preference for 
the set of musical selections as a whole, 
high numerical scores representing а high 
degree of liking for the jazz. The arousal 
level scores, in turn, consisted of an indi- 
vidual's sum score for all three selections 
combined. This is the case wherever arousal 
level scores are presented, unless otherwise 
specified. Our conclusion from Tables 3 to 
6 is that preference level does not, by and 


separated in terms 


large, influence arousal effects. In both 
cases, the preference variable alone 15 
The inter- 


clearly without significance. 
action, in turn, while clearly insignificant 
in the case of anxiety level and preference, 
shows a slight trend, although still being 
insignificant, in the case of social extra- 
version and preference. Since no clear 


TABLE 6 


AROUSAL TO Music FOR 


A {18 OF VARIANCE 
edis 7 ERENCE GROUPS 


preference effects were demonstrable, no 
reason remained for separating our 5$ on 
the basis of preference level. 

Turning to the effects of personality, we 
find a strong relationship between level of 
manifest anxiety and symbolic sexual 
arousal (Tables 3 and 4), those 5$ who 
declare themselves more anxious being 
more arousable to the music. No compar- 
able relationship occurs when one considers 
social extraversion alone (Tables 5 and 6). 
Such a finding is irrelevant to our predic- 
tions, however, since the hypotheses to be 
tested concerned interactive effects between 
social extraversion and anxiety level, and 
made no predictions concerning social ex- 
traversion per se. The necessary method, 
then, given the absence of preference effects, 
was to consider the joint effects of anxiety 
level and social extraversion-introversion 
on arousal. 

As we see in Tables 7 and 8 these effects 
are considerable. Again, anxiety level taken 
by itself is positively related to arousal. 
But most noteworthy is the very strong 


TABLE 8 


ANALYSIS OF VARIANCE, AROUSAL TO Music FoR 
SOCIAL EXTRAVERSION AND ANXIETY GROUPS 


SocraL EXTRAVERSION Амр PREF 
(N = 44) (N = 60) 
F < Source df Mean F p< 
Montes 4 ied i Square 
Soci Social 
REI 8 1 5.82 1.08 ns Extraversion 1 8.81 2.23 п$ 
E | a | Bee oe E Anxiety. 1 [з0 | 7.80 | .01 
I RA Ee 1 13.09 2.42 ns Interaction 1 62.03 15.70 .001 
ue | | 5% Within Cells | 56 | 3.95 
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interaction (significant beyond the .001 
level) between level of manifest anxiety 
and social extraversion-introversion in de- 
termining arousal Social introverts high 
in manifest anxiety level exhibit a greater 
degree of symbolic sexual arousal than 
social extraverts whose anxiety level is the 
same. On the other hand, social introverts 
low in manifest anxiety level show a lesser 
degree of such arousal than do social extra- 
verts of the same low anxiety level. Social 
extraversion-introversion per se had no 
effect on arousal, then, because its relation- 
ship to arousal strongly depends on the S’s 
level of manifest anxiety. For while those 
social introverts high in anxiety level show 
the greatest arousal to the music, social 
introverts low in level of anxiety show the 
least arousal, with both high and low anxi- 
ety social extraverts falling between these 
two extremes, These findings confirm the 
two hypotheses of the investigation. 


Examples of Stories to Music 


To provide a picture of the kinds of 
stories told by members of these four 
groups, here are typical stories by indi- 
viduals of each type. The stories are all in 
Tesponse to the third musical selection—the 
piece intermediate in over-all arousal level, 
: High. anxiety social introvert stories. 
15 stirring a huge pot of bo 
thickly Swells and plops, se 
Which fill the room and hyp: 


А girl is being chased down an 


empty street by 
four men. tied 


into the apartment 


He slaps the horse with the 
whinnying. 


1 goes back to 
This method works and the 


man and horse are confident again. The man tries 
to mount but the horse throws him off. Eventu- 
ally, after a long struggle the horse gives in to 
the man's weight and they are friends. E 

Beautiful woman in green satin dress, half in- 
toxicated, half exhausted from lack of sleep, 
lackadaisically starts climbing flight of stairs to 
her room. Loses balance and falls—dead! Her 
husband returns and sees woman in purple dress— 
doesn't recognize her. Steps over dead body; 
climbs stairs, and goes to bed. He never wakes Шр. 

Low anxiety social introvert stories. This is the 
story of a young man who loved music, He was 
not irom a wealthy or a poor family, He came 
from a small northern town of a middle-class 
family. Of course his parents were against him 
going into the music profession. His father 
been with a well-known band in the late 1920'S. 
had always wanted to go on to play with his od 
band. He never made it, and consequently he wa 
afraid that his son would find himself in the 
same position. 

A college student has the “blues.” The P 
week has been a hard one for him. He flunke 
the exam he had, his roommate got sick, and һа 
to be sent home, he couldn't get а date for the 
big dance tonight. It's а lovely Saturday evening 
as he sits in his room trying to study while al 
his friends are at the dance. t 

A new family moved into the rough section O 
town. They were poor but respectable. The 
mother watched with horror her son grow into а 
rude, troublesome teenager. His talk, when he 
Spoke to her, was of playing "chicken," rock an 
roll, and the gang. 
up smoking. She was sure he drank. His friends 
were tough; dressed in leather jackets with gaudy 
symbols on the back, Every 


the neighborhood: 
Son could never hurt апу” 
him what's right. 

. To play such music in such а way as to make 
it speak, not only does the musician have to have 


or his mother, etc, 
audience or perhaps or 
audience feels the bea 
man hears it in the dis 


His sadness is told to the 
пе lonely member of it. The 
uty of his music, the lonely 
tance, but neither understan 
the powerful emotion behind а player of the blues: 

High anxiety social extravert stories, This 15 
the nonconformist who is ruled by his passions 
and emotions, He won't listen or be put into дЕ 
mold made for him. His only outlet is to expres: 
the freedom of feeling—to cleanse his system 0 
the agitation and bitterness within. He cares do 
so feels driven to show tS 
n objective view. He is аё 


expression is а way of 


S revolt for him. Не h?* 
Something new to say 


i 
and says it regardless ° 
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public opinion and criticism. His thoughts are 
vivid yet rash, but isn't that so of anyone who has 
had a new idea and wishes to convey it? He must 
suffer for his outwardness. 

Ethel welcomed her husband home with a smile 
and words of kindness. She seductively led him to 
the love seat, begged him to make himself com- 
fortable, and then announced that she had a few 
things to say. Poor Bill—he wasn't expecting the 
"torrent of words that rained at him—they were 
more like a continuous blast of thunder. Higher 
and higher, faster and faster went Ethel's voice as 
she accused him of everything from infidelity to 


acts of treason against her flower garden. ВШ 


couldn't say а word—not only could he not talk 
louder or faster, but there was no opportunity for 
him to even begin. Finally she stopped and stood 
staring at him, laughing uncontrollably. 

It's almost as if the trumpet can talk—the player 
is telling the world they're not the only ones who 
have troubles—he's blaring his aloud and baring 
his heart to all—some get saved (increased tempo 
and other instruments) but others keep dragging 
on—the color blue is suggested for it depicts the 
*"plues" and this person has definitely got them! 
He's reassuring though—for one realizes everyone 
is in the same boat. 

I picture a man, hurt yet unable to strike out in 
retaliation. Instead he roams the street, drunk, 
poor, another bum who wishes there was some way 
out, but doesn't have the strength to look for it. 


Low anxiety social extravert stories. In order to 
get to the telephone building in Bowdoin Square 
we had to walk through Scully Square. As we 
were passing ап open door Greg suddenly pulled 
my hand so hard that I had no choice but to fol- 
low him. Inside a very dimly lit room there was 
a jukebox playing, and the men clustered about 
the bar were swaying with the rhythm. 1 tried to 
pull Greg away from the door, but he was fas- 
cinated, more by the people than the music. Sud- 
denly the sound sharpened and I tugged Greg 
away but we walked away quite slowly so we 
would hear the music. 

It is a dark, humid, heavy evening in early 
August when everything seems to be sticky and 
one feels that he just can't do anything. But one 
is just wasting time. Restlessness and boredom— 
where to go and what to do. One rises from the 


` chair in which one was just slouching and gets а 
Go for 


good book—can't concentrate, no interest. 
a ride—stop get a cool drink—visit some friends to 
say hello—return home ànd the same sluggishness 
engulfs the mind and body. What to do and 
where to go—just to release the nothingness that 
the stillness and the calm that the whole day 


Ex sthheld the force of the cruel 

The large trees withhei пе force : 
wind that blew through the woods. Baby Bunny, 
who should have known better, had wandered away 
So interested Was he in the little 
he ground that he hadn't 


hunting hounds. All of a 


from home. 
things he found along t 
heard the baying of the 


sudden, he hears a loud bark, and looking up he 
just stood still—not knowing what to do. The 
hound was also surprised to see such a little fel- 
low out alone on such a cold day—barked again— 
Baby Bunny at that point decided to run—and 
away he went. The hound thought he would have 
a little fun—so chased. . . . 

It was Sunday and all the workers of the farm 
had the day off. Today was a special day for it 
was the annual picnic day. Everyone dressed 
сазиаЙу and came down along the river banks. 
There they all stretched out, opened their picnic 
baskets, and ate as they watched the boats go by 
them. The small boats went along at their leisure, 
in no particular hurry to reach their destination. 
A few larger ones came by, making more noise 
and causing more excitement among the specta- 
tors. As the day drew to a close a group of happy 
people gathered together their belongings to pre- 
pare for the return to the farm. 


Control for Possible Differences in 
Basal Level of Arousal 


Are the strong arousal effects for the 
high anxiety social introverts truly a func- 
tion of listening to the music? Might they 
not simply result from a higher basal level 
of symbolic sexual arousal for these $s? In 
such a case, their verbal output would re- 
flect this high arousal level, whether the 
stimulus be high or low arousal music. 
Several further analyses, however, suggest 
that listening to the music is in fact influ- 
encing this group's arousal level, although 
they also seem to have a somewhat higher 
basal level than the others. 

First, recall our finding, for the sample 
as a whole, that over-all arousal was lowest 
ior Piece A, highest for B, and intermedi- 
ate for C. A similar analysis was done for 
the 15 high anxiety social introverts as a 
separate group, yielding the same results: 
mean arousal levels for these Ss being 1.33 
for A, 245 for B, and 2.33 for C. Once 
again, statistical evaluation was made in 
terms of the difference between each S’s 
arousal scores for any two musical pieces, 
using a t test to determine whether to reject 
the null hypothesis of zero difference. For 
B-A this ¢ value was 2.45 and for C-A it 
was 2.20, both being significant beyond the 
05 level. For С-В the t value was zero, 
since the mean arousal difference score for 
C-B likewise was zero. Thus, the high 
anxiety introverts, like the sample as a 
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TABLE 9 


MEAN LEVELS OF AROUSAL TO SELECTION А FOR 
SOCIAL EXTRAVERSION AND ANXIETY GROUPS 


AND CAROL GREENBERG 


TABLE 11 


MEAN LEVELS OF AROUSAL TO SELECTION B FoR 
SOCIAL EXTRAVERSION AND ANXIETY GROUPS 


(N = 15 per cell) (N = 15 per cell) 
i i i Hi i High Social 
High 5 1 High Social High Social Hig 1 
ЕО ня Extraversion Introversion 
High Anxiety 0.73 1.33 High Anxiety 1.60 nes 
Low Anxiety 0.66 0.66 Low Anxiety 1.66 1. 


whole, show greater arousal to selections В 
and C than to А. 

Second, and more important, recall that 
our analysis of variance demonstrating an 
interaction between anxiety level and social 
extraversion-introversion in determining 
arousal level used an arousal score based on 
all three musical selections combined. Sup- 
pose now we were to carry out three suc- 
cessive analyses of this kind, using arousal 
scores for each selection alone as the de- 
pendent variable. Tf a higher basal level of 
arousal is the only cause of the high score 
for anxious social introverts, then the inter- 
action effect should be significant for the 
first piece—i.e., the one which is lowest in 
arousal level for the group as a whole—as 


well as for the others. If, on the other 
hand, the significant 


groups, that, in other words, the music had 
an effect. 

Tables 9 through 14 present the relevant 
information. We find that the anxious 
Social introverts’ degree of arousal in 
stories to Piece A (Tables 9 and 10) is not 
significantly different from that of the other 
three groups: the interaction F is 1.34. For 
Piece В (Tables 11 and 12), on the other 
hand, this interaction F approaches signifi- 
cance, being 2.85 (passes the .10 level of 
significance) ; and for Piece C (Tables 13 
and 14), this interaction F is highly signifi- 


TABLE 12 


ANALYSIS OF VARIANCE, AROUSAL TO 5кьвстїох P 
FOR SOCIAL ExTRAVERSION AND ANXIETY GROUPS 


t interaction. for all QN = 60) 
three pieces Combined is found to depend 
on the second and third pieces but not on Source df Mean F p< 
the first piece, this will indicate that the Square 
change from low to high arousal music Social 
caused a greater increase in arousal for the p cw 1 1.06 | 0.71 ns 
anxious introverts than for the other Anxiety 1 3.26 247 95 
Interaction 1 | 4.28 | 2.85 | .10 
Р Within Cells 56 1.50 
TABLE 10 
ANALYSIS OF VARIANCE, AROUSAL TO SELECTION A 
FOR SOCIAL EXTRAVERSION AND ANXIETY GROUPS 
(N = 60) TABLE 13 
мг геш OF AROUSAL то SELECTION C а 
SOCIAL ExrRAvE N wx 3ROUP 
Source df Mean F p< p уйне aaa 
Square - per cell) 
Social í jal 
. " : боса 
Extraversion 1 1.88 1.32 ns Ed. eS n В парня 
Anxiety 1 2.00 1.96 ns — oe = 
Interaction 1 1.37 1.34 ns Higl iety 
Within Cells | 56 | 102 i d et 2:60 


- 
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TABLE 14 


ANALYSIS OF VARIANCE, AROUSAL TO SELECTION C 
FOR SOCIAL EXTRAVERSION AND ANXIETY GROUPS 


TABLE 15 


MEAN NuMBER ОЕ WORDS IN STORIES TO MUSIC FoR 
SOCIAL EXTRAVERSION AND ANXIETY GROUPS 


(№ = 60) (N = 15 per cell) 
Source df Eu F p< High Social High Social 
quare Extraversion | Introversion 
Social High Anxiety 3 
Anxiety 211.3. 9! 
Extraversion 1 1.06 0.66 ns Low Anxiety 212. 67 up 
Anxiety 1 | 5.40 | 3.37 08 hci 
Interaction 1 21.60 13.50 001 
Within Cells 56 1.60 š 
In sum, the heightened arousal for 
anxious social introverts seems indeed to 
am (= 1285; which passes the 2001 be an effect of listening to the music. It 
( SEMI a в does not significantly occur for the music 


level). Thus, the greatest tendency for our 
anxious social introvert group to exceed the 
others in arousal occurs for Piece C—the 
piece which, we recall, is intermediate in 
arousal value for the sample as a whole. 
The results for Selection B—the piece 
which is highest in overall arousal value— 
show a trend in the same direction. Finally, 
no such trend occurs in the case of the 
music lowest in arousal value, Piece A. 
The difference in results for Pieces В 
and C becomes understandable from a con- 
sideration of the cell means in Tables 11 
and 13, respectively. In both cases, the 
arousal level for high anxiety social intro- 
verts is 2.40, and that for low anxiety social 
extraverts is in the neighborhood of 1.50. 
The fact of Piece B's being highest in gen- 
eral arousal value pushes up the mean 
arousal for the other two cells of Table 11, 
whereas these cells remain low for Piece C 
the selection intermediate in overall 
arousal value (Table 13). The stimulus 
value of the music with regard to arousal 
level, then, has its greatest influence for the 
pieces that are lowest and highest in arousal 
value—Selections А and B, respectively. 
has its greatest chance to influ- 


Personality г 1- 
ence arousal when, stimulus-wise, the на 
has an intermediate degree of arousa 


in the case of Piece С. 
hose arousal reactions 
because of person- 
ng. Those cells, on 
sonality would 
show a 


value: namely, 
Here, the groups W 
to the music are strong 
ality factors remain stro 
the other hand, where perso 
lead to weak arousal reactions, 
drop. 


lowest in arousal value, but appears for the 
other selections. 


Control for Possible Differences in 
Verbal Facility 


А Опе might argue that the greater arousal 
for high anxiety social introverts is not due 
to group differences in the tendency to use 
sexual symbolism in one's stories, but rather 
simply to differences in story length. That 
is, the chances might be greater for sexual 
symbolism to appear in the stories of 
anxious social introverts simply because 
they write longer stories. To evaluate this 
point, the number of words in all three 
stories combined was calculated for the 
members of the four groups under discus- 
sion. Table 15 presents means for these 
groups and Table 16 an analysis of vari- 
ance, indicating that no significant varia- 


TABLE 16 


ANALYSIS OF VARIANCE, NUMBER OF WORDS IN 
STORIES TO Music For SOCIAL EXTRAVERSION 
AND ANXIETY GROUPS 


(N = 60) 

Source df Mean F p< 

Square 

Social 

Extraversion 1 11.26 0.33 ns 
Anxiety 1 59.73 1.74 ns 
Interaction 1 69.54 2.03 ns 

Within Cells 56 34.31 


Note.—To facilitate computing the analysis of variance, 
individual scores were divided by 10. 
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tions in story length differentiate our 
groups. Comparable analyses of variance 
done for the length of first, second, and 
third stories separately likewise showed no 
significant variations in story length—no F 
value exceeding the largest one in Table 16. 
The stories written by high anxiety social 
introverts, then, were not significantly 
longer than those by members of the other 
three groups. Some tendency of this kind 
may be operating, however, since—as shown 
in Table 15—the means are in the expected 
direction, although not significantly so. 


Agreeing Response Set 


It should be noted that all of the Mani- 
fest Anxiety scale items (the Maudsley 
Neuroticism scale) are keyed in such a way 
that “yes” responses cause a high anxiety 
Score, while two-thirds of the social extra- 
Version items (the Maudsley Extraversion 
Scale) are keyed in such a way that “yes” 
responses cause a high social extraversion 
Score. If an agreeing response set (Cohn, 
1956; Couch & Keniston, 1960) were oper- 
ating in our sample, one hence would ex- 
pect a significant positive correlation be- 
tween the two scales, since a person scoring 

igh in manifest anxiety would score in the 
middle 9n social extraversion. But the cor- 
relation between scores on the Manifest 

Xiety and Social Extraversion scales was 

-15, which is nonsignificant for the sample 
of 88 for whom scores were available, and 
even negative in direction. 
. more important indication th 
ing response set doesn’t exert а potent influ- 
ence on arousal is the complicated relation- 
ship between arousal, social extraversion— 
introversion, and manifest anxiety. The 
most “yes” responses are given by high 
anxiety extraverts, and yet they fall in the 
middle on the arousal distribution, being 
excelled by Ss (high anxiety introverts) 
who give more “no” responses. 

These points argue against the influence 
of a “yea-saying” response set on arousal. 


at agree- 


Motivational Differences Other Than in 
Symbolic Sexual Arousal 


It is clear from the illustrat 


ive stories 
presented above that aspects of 


motivational 


content other than symbolic sexual arousal 
also may differentiate our personality 
groups. The low anxiety social introverts, 
for example, would seem to be deniers ог 
repressors of affective involvement. Their 
stories strive to present as literal, descrip- 
tive renderings of the music as is possible. 
Further, the high anxiety social extraverts 
seem to be individuals whose contacts with 
other people are covertly dominated by the 
hostile wish to hurt them—as expressed in 
story fantasy. It is possible that while the 
main motivational effect of music for high 
anxiety social introverts is to elicit covert 
sexual expression, music's main motiva- 
tional influence for high anxiety social ex- 
traverts is to stimulate covert expression О 
hostility. In each case, the music woul 
permit displaced, implicit expression of а 
kind of motive which is denied outlet at the 
overt level: the anxious social introvert in- 
hibiting sexual needs for others, the anx- 
ious social extravert inhibiting aggressive 
feelings toward others and counterphob- 
ically insisting that he likes people. 1 
This points toward the idea that verba 
expression of social extraversion, if accom- 
panied by high manifest anxiety, may be 
just as indicative of the failure of a mode 
of social adaptation as the accompanying 0 
social introversion by high anxiety. The 
anxious social extravert, rather than want- 
ing to be with other people, really may hate 
them! Further exploration of this poss! 
bility must wait upon continuing research: 


Emotional Extraversion-Introversion 
The correlation 
version and emotior 
Was .16, which is 


sample of 88. Thes 
seem to be me 


between social extr" 
nal extraversion score? 
nonsignificant for ОШ 
€ two scales do in #26 


ә asuring different things. 3 
borderline Correlation, in turn, exists D€ 


tween the Emotional Extraversion а? 

Manifest Anxiety scales: the r of .23 pass, 
ing the .05 level with 88 Ss. As one wea 
might expect, the tendency to be genera p 
expressive of one's feelings (emotional €* 
traversion) has some relationship with © 

tendency to express feelings of worry ай, 


er : : gi- 
pue m particular (manifest an* 
ety). 


——— 
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TABLE 17 


MEAN LEVELS ог AROUSAL TO Music FOR 
EMOTIONAL EXTRAVERSION AND 
PREFERENCE GROUPS 
(N = 11 per cell) 


High 
Emotional 
Introversion 


High 
Emotional 
Extraversion 


High Preference 


4.36 
Low Preference 5.18 


a3 


3.2 
3.2 


This last correlation indicates a need to 
investigate whether the emotional extra- 
version dimension behaves at all like mani- 
fest anxiety level in its relationships to 
symbolic sexual arousal. The relevant in- 
formation is presented in Tables 17 through 
22. Contrasting emotional extraverts and 
introverts with high or low general prefer- 
ence for the jazz selections, we find from 
Tables 17 and 18 that the same general 
effect occurs as in the case of manifest 
anxiety, although with less intensity. Again, 
preference level makes no difference, and 
emotional extraverts tend to be more 
aroused to the music than emotional intro- 
verts. When one pits anxiety level and 
emotional extraversion against each other, 
however (Tables 19 and 20), we find a 
clearly significant effect occurring only for 
the former. Finally, contrasting emotional 
and social extraversion (Tables 21 and 22), 
we find a trend toward significance for the 
interaction effect (р < :07) : those who are 


TABLE 18 


ANALYSIS OF VARIANCE, AROUSAL TO MUSIC FOR 
EMOTIONAL EXTRAVERSION AND PREFERENCE 


TABLE 19 


MEAN LEVELS OF AROUSAL TO MUSIC FOR 
EMOTIONAL EXTRAVERSION AND 
ANXIETY GROUPS 


(N = 13 per cell) 


High High 
Emotional Emotional 
Extraversion | Introversion 
High Anxiety 5.92 4.23 
Low Anxiety 3.46 3.23 


emotional extraverts and social introverts 
being highest in arousal level. These are 
individuals who claim that they are emo- 
tionally expressive but yet have declared 
that they keep apart from other people. We 
recall that this same interaction is highly 
significant when anxiety level and social 
extraversion-introversion are contrasted. 
These findings suggest that manifest 
anxiety level and emotional extraversion- 
introversion have something in common— 
namely, the tendency to express one's vari- 
ous feeling states. But the expression of a 
particular kind of feeling state—namely, 
anxiety, worry, and depression—seems 
most directly related to symbolic sexual 
arousal, since we obtain much stronger 
arousal effects when varying anxiety level 
than when varying emotional extraversion. 
This is consistent with what our theory 
leads us to expect, since a defense's success 
or failure was defined in terms of its rela- 
tion to a high or low level of anxiety in 


TABLE 20 


ANALYSIS oF VARIANCE, AROUSAL TO Music FOR 
EMOTIONAL EXTRAVERSION AND ANXIETY GROUPS 


GROUPS 
(N = 44) (N = 52) 
Source df Mean F | p< Source | df Mean F p< 
Square | Square 
ee `— 
Emotional | Emotional 
Extraversion 1 | 24.75 | 5.04 .05 Extraversion 1 12.02 | 2.71 ne 
Preterence р 1 1.84 | 0.37 ns Anxiety 1 | 38.94 | 8.79 01 
Interaction 1 1.84 | 0.37 | ns Interaction 1 6.94 | 1.57 zn 
Within Cells 40 4.01 Within Cells 48 4.43 
| 
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TABLE 21 


MEAN LEVELS ОЕ AROUSAL TO Music For SOCIAL 
EXTRAVERSION AND EMOTIONAL EXTRAVERSION 
Groups 
(N = 17 per cell) 


High Social High Social 


Extraversion Introversion 
High Emotional А 
Extraversion 3.53 5.41 
High Emotional 
Introversion 8.71 3.59 


particular. And failure or success of the 
introversion defense, in turn, was the cru- 


cial determinant of strength of symbolic 
sexual arousal. 


Sex Differences in Arousal 


Finally, our results seem to shed light on 
Beardslee and Fogelson’s (1958) finding 
that female undergraduates tend to be 
higher in symbolic sexual arousal to music 
than do male college students. If the 
Strongest determinant of such arousal is 
fantasy displacement of sexual needs that 
are hindered from more direct expression, 
then the contrasting social norms in our 
culture regarding the sexual activities of 
unmarried men and women would lead 
to expect the result reported. 
societal prohibitions against gir 
Sexual expression should lead t 


us 
The stricter 
Is for direct 
O à stronger 


TABLE 22 


ANALYSIS OF VARIANCE, AROUSAL TO Music ron 
SOCIAL ExTRAVERsION AND EMOTIONAL 
ExTRAVERSION Groups 


(N = 68) 
Source df Mean F p< 
Square 
Social 
Extraversion 1 13.24 2.81 ns 
Emotional 
Extraversion 1 11.53 2.44 ns 
Interaction 1 17.00 3.60 07 
Within Cells 64 4.72 


need in the case of unmarried women than 
men for displacement arousal in response 
to music. 


SUMMARY AND CONCLUSIONS 


A study was carried out to test the fol- 
lowing theory about music and резни 
(а) One function of listening to music 8 Е 
permit the symbolic expression of sexua 
impulses. (b) The strongest impetus "i 
ward such expression arises for person 
who claim to be socially introverted, but 
whose attitude of introversion does um 
constitute a successful defense as indicatet 
by the presence of a high level of pon 
Since the need for others remains and ne 
its direct sexual expression has been iil 
hibited, the only open path is that of [p 
displacement. (c) A weaker impetus ү 
ward such expression arises for those zd 
picting themselves as socially extraverte" 
simply as a reflection of their social involve 
ment. Since direct sexual expression is St! 
an available channel, however, gy 
expression is not the only permitted rout 
and hence should be less intense than in the 
first case. (d) The weakest impetus towar” 
Such expression exists for those whose "i 
tude of introversion is a successful mode 
of defense as indicated by the absence 07 
anxiety, since such success means ther 
need for other persons really has been over” 
come. Hence, a minimum of symbolic ех 
pression is required. » 

Two hypotheses were derived from this 
theory: (a) Among those high in admitte" 
anxiety level, 55 describing themselves ая 
socially introverted will show a greater 
degree of symbolic Sexual arousal to music 
than those calling themselves socially extra 
verted. (b) Among those low in admitte 
anxiety level, Ss calling themselves social!) 
introverted will show a lesser degree О! 
Symbolic sexual arousal to music than tho5" 
considering themselves socially extraverte" 

To evaluate these hypotheses, an expe" 
ment was devised involving assessment О! 
Symbolic sexual arousal from stories r 
spired by listening to music. The results: 
using female undergraduates as Ss, strong!) 
confirmed both predictions. A number 0 


bis 
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points of interpretation were treated in the 
course of describing the procedures and 
results, including the following: the as- 
sumed relevance of the story-scoring cate- 
gories to symbolic sexual arousal was con- 
sidered. Strong relations were found 
between the independently judged sexual 
stimulus values of the musical selections 
and their respective symbolic sexual arousal 
potentials as inferred from story content. 
'The possibility was examined of accounting 
for the results as due to differences in basal 
level of arousal rather than in the effects 
of the music, and it was concluded that the 


music itself was the influential factor. In 
addition, the obtained results were not 
found to be due to group differences in 
verbal facility, nor could they be the prod- 
uct of an agreeing response set. Comments 
were made on motivational differences other 
than in symbolic sexual arousal. Finally, 
relations to emotional extraversion and an 
interpretation of sex differences in symbolic 
sexual arousability also were discussed. 

The results of the present research seem 
to suggest the fruitfulness of further work 
on music’s motivational functions for the 
individual. 
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Т is one of a series of studies in 
which various relations between psy- 
chological events have been examined as 
determinants of attitude and attitude 
change. The opposites relation is of interest 
for a number of reasons. In the first place, 
scattered bits of evidence (Woodworth, 
1938; p. 344; Riess, 1946; Kreezer & Dal- 
lenbach, 1929) suggest that the opposites 
pattern may emerge as a result of the dif- 
ferentiation of a similarity dimension and 
consequently may involve greater distances 
between parts of the structure than do less 
differentiated similars. The nature of gen- 
eralization within an opposites structure has 
been explored by one of us (Peak, 1958b, 
1958c). 

The opposites Асер: has also been in- 
voked in discussions of personality char- 
acteristics, most particularly as an aspect of 
the authoritarian syndrome which has been 
said to involve extreme tendencies to di- 
chotomize (Adorno, Frenkel-Brunswick, 
Levinson, & Sanford, 1950). Again, the 
structure 15 1M- 
plicit in the descriptions of certain defense 
mechanisms. For example, projection, de- 
fined as the attribution of characteristics or 
feelings possessed by an individual to рег- 
sons and objects in the outside world, prob- 
ably requires а dichotomization of the con- 
cepts of the self and certain others, for it 
should be easier to blame others with im- 
punity if they are highly differentiated from 
the self. Reaction formation or reversal, 
involving the overt expression of an act or 
feeling opposite the one being defended 


Е Р 
1 This investigation was carried out under a 


tween the Office of Naval Research 
E не А. of Michigan, Project Nonr 
1224 (10) NR 171-039. The senior author was in 
residence at the Center for Advanced Study in the 
Behavioral Sciences when this was written. 
* 


against, also required examination from this 
point of view. 

A pretest and an earlier study (Peak 
1959a) indicated that certain measures as- 
sumed io reflect opposites structure tended 
to be positively correlated. The pretest also 
supported the notion that a preference for 
projection was positively related to oppo- 
sites measures but that reversal or reaction 
formation preference, contrary to expecta- 
tions, had a negative relation to these meas- 
ures. The investigation to be reported 
sought additional evidence of a general 
tendency to dichotomize and examined the 
relations of opposites measures to certain 
defense preferences, to ingroup, outgroup, 
and self-attitudes, to scores on the Cali- 
fornia F Scale, and to other variables, with 
the expectation that methods of dealing 
with psychological distance might prove to 
be a common determinant of many such 
psychological processes. 


Definition of Opposition 


Similarity has usually been defined as a 
relationship in which events share one or 
more common properties. Opposed events, 
like similars, also have common properties, 
the principal difference residing in the 
greater psychological distance between op- 
posites, a distance which has become suffi- 
ciently large to produce subcategories within 
the common dimension. But it should be 
noted that such events are opposites only 
as long as the emerging subcategories retain 
positions on the same dimension. Opposites 
must be alike in some degree. For example, 
clean and dirty may in some cases be seen 
as possessing different qualitative proper- 
ties (subcategories), though from another 
point of view they are simply different 
amounts of the same property. Again, lik- 
ing and disliking share the property of 
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affect which appears to be structured as a 
bipolar continuum passing through a zero 
point representing no affect or evaluation. 
Presumably, any continuum made up of 
degrees of some property may under cer- 
tain conditions become sufficiently differ- 
entiated so that its ends come to constitute 
subcategories and thus an opposites struc- 
ture, provided some relation remains be- 
tween the dichotomies.? 


Defenses 


In the following pages frequent refer- 
ence will be made to the defenses of projec- 
tion and reaction formation or reversal. 
These concepts have been assigned many 
different meanings in the literature. In this 
discussion they will be used to refer to 
processes of the kind described on D. 1 
and identified by the Blum Defense Pref- 
erence Inventory. The extent to which 
the results obtained here are generalizable 
to other measures of projection and re- 


versal can be determined only by further 
observations. 


Hypotheses 


The following hypotheses were tested : 


1G Different measures of opposites struc- 
ture will tend to be positively related. Since 
no psychological characteristics are ever 
completely general over all situations, it 
was not expected that all such correlations 
would be significant. It was rather sup- 
posed that the correlations would be higher 
as the methods of measurement and the 
nature of the situations involved in the 
indices became more similar, For example, 
there should be a higher correlation between 
two indices based on deviations from the 
midpoint of a Semantic Differential scale 
than between two derived from such a scale 
in one case and from opposites associations 
in the other. Furthermore, similarity in 
content of the indices should also be related 


2 These relations are discussed in detail in 
another paper (Peak, 1958a), including the fact 
that under certain conditions the reactions to the 


terms of an opposites structure may cancel each 
other. 


to the size of the correlation between them. : 
However, in the absence of рге-ехрегі- 

mental commitments about the degree of. 
similarity of each index to the others, no 

predictions were made about the relative 
amounts of correlation between specific 

measures here employed. 

П. Preference for projection will be 
negatively correlated with preference for 
reversal or reaction formation. This rela- | 
tion has been observed in an earlier study 
by Goldstein (1952) and more recently 1" 
our pretest. 

IIIa. Preference for projection will be 
positively related to a tendency to opposites 
structuring. 

IIIb. Preference for reversal will. be 
negatively related to opposites structuring: i 
We have already noted the rationale 401 
Hypothesis Па ; the prediction of IIIb yee 
made on the basis of pretest results without | 
a clear understanding of why this shou! 
be so. This will be discussed later. 

IV. Those who prefer projection 
have less favorable attitudes toward the o, 
group and more favorable attitudes towa" 
the ingroup than those who do not pref" 
projection; 1.е., there will be a negative on 
relation between preference for project?" 
and outgroup attitude and a positive cor. 
lation between ingroup attitude and proje 3 
tion. Furthermore, projection preferente 
and difference in ingroup and outgroup a 1 
tudes will be positively correlated. 1 

The greater negativity toward the 0% 
group and more favorable attitude towar 
the ingroup would follow from the famil 
assumption that a person's hostility iS di n 
placed from the ingroup to the ошто 
which is at once hated and blamed i 
hating. The blame is said to be a ratio? 2: 
ization to the effect that if outgroup теш 
bers are hostile it is justifiable to retur? gr 
hostility. Since the distance betwee? ре 
group and outgroup attitudes may (o 
thought of as a measure of the tendency {ef 
dichotomize these concepts, the 81 fof 
mgroup-outgroup difference predicted. 15 
the projector would be expected, for UP 
casier to express hostility to the out” т 
with impunity if the outgroup and ingt’ 
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will | 


| 
| 
| 
| 
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аге clearly separate. It will be suggested 
later that the increased distance between in- 


.group and outgroup for projectors, as well 


as the reversal results, could be explained 
in terms of a style of dealing with disparity 
or distance which is the basis of a motive 
system. 

V. Those who prefer reversal will have 
more favorable outgroup attitudes than 
those who do not prefer it; there will be а 
positive correlation between outgroup atti- 
tudes and reversal preference. 

Since this defense is defined as involving 
responses in a direction opposite to those 
which are being defended against, it was 
expected that positive attitudes toward the 
outgroup would accompany high reaction 
formation preference. Positive attitudes are 
predicted as defensive reactions because 
negative attitudes toward outgroups are as- 
sumed to be more likely to require defen- 
siveness and reversal. 

VI. Women will tend to prefer reversal 
as a defense more often than will men and 
men will prefer projection more often than 
will women. This tendency was observed in 
the pretest and the suggestion was made 
that such a difference might result from the 
greater social penalties attached to the overt 
aggressiveness that tends to accompany 
projection of hostility, while the more posi- 
tive responses resulting from reaction for- 
mation defenses in a hostility situation 
would be the more approved pattern of 
feminine behavior. 


PROCEDURE 


Undergraduate students at the University 
of Michigan served as subjects (55). The 
data were collected during two one-hour 
sessions approximately one week apart. 
Men and women were tested at differ- 
ent times because of the difference in 
standard procedures on the Blacky Test for 
the two sexes. During the first session these 
tasks were performed : The Blacky Test, = 
cluding the Defense Preference Тау ей 
the Kent-Rosanoff free association test, the 
California Е Scale, and Ше їп ше 
sheet. At the second session Ss continued 
with the free association test, They sorted 

> 


108 statements about Negroes, judging the 
degree of favorableness which the state- 
ments represented. And finally they rated 
24 concepts on Semantic Differential scales. 


Blacky Test 


Standard procedures were followed in admin- 
istering Blum's Blacky Test? and Defense Prefer- 
ence Inventory (Blum, 1950, 1956, 1957; Blum & 
Hunt, 1952). Ss wrote a story about each picture 
after the projection of that picture on the screen. 
АП stories about the pictures were completed be- 
fore the Defense Preference Inventory was given. 
This involved exposing each picture a second time, 
and asking Ss to rank five statements in the order 
of their appropriateness to the picture that had just 
been shown. Each of these standardized statements 
was intended to reflect a preference for either 
projection, reaction formation, avoidance, regres- 
sion, or intellectualization. The statements refer- 
ring to a given picture were ranked before the 
next picture was presented. A man administered 
the test to the men students; a woman, to the 
women students. 

Conflict scores were derived from the stories 
which were coded by Blum’s procedures. This 
coded material also provided data for an index of 
hostility. This was a count of the number of 
hostility items in responses to all the pictures.* 
Hostility scores were also expressed as a percent- 
age of the total "ideas" written in the stories. 
Rules were made for classifying all the items in 
the stories into unit ideas. The coder had no 
information about the hostility or conflict scores of 
the S whose protocol was being coded, nor was 
she familiar with the Blacky scoring method. The 
number of these idea units in all the stories was 
counted and divided by the number of stories. 

It was planned at the outset that two DPI (De- 
fense Preference Inventory) scores would be used 
in analyzing the data. (a) An average rank of all 


3 We are grateful to Professor G. S. Blum for 
his advice on the use of the Blacky Test and to 
Mrs. Blum for training one of us to score the 
projective stories on which hostility scores and 
conflict scores were based. 


4 Blum's General Hostility Pattern (1957, р. 12) 
was modified for this purpose by Barbara Muney 
and the hostility score included items from the 
stories which fell into these classes: overtly ex- 
pressed oral hostility, hostile interaction with par- 
ents in an anal context, resentment against the 
parents for lack of attention in oedipal context, 
readiness to experience anger or to engage in ag- 
gressive behavior in response to discipline, overtly 
expressed hostility, feelings of rejection in com- 
parison with sibling, attraction to the sibling 
rivalry situation, and certain denial statements 


implying hostility. 
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the statements reflecting any one class of defense 
in response to all pictures. Thus, the reversal score 
was based on the average rank of the reversal 
statements, the projection score on the average of 
all projection ranks, etc. Because of an error in 
the order of DPI statements for two of the pic- 
tures (X and XI) in one section of women stu- 
dents, total DPI scores were derived from nine of 
the eleven pictures. (b) An average of the ranks 
of each class of statement in response to those 
pictures which Blum identified as hostility situa- 
tions. These hostility pictures were selected be- 
cause it was thought likely that defense preferences 
based on hostility situations would have a clearer 
relation to ingroup-outgroup attitudes than would 
responses to the other pictures. This proved to be 
the case, although the direction of results was the 
same for both DPI indices. The major part of the 
analysis reported at this time is based on DPI 
scores derived from the hostility pictures, though 
reference will be made to certain relations involv- 
ing DPI scores based on high conflict pictures. 
This is the third method of dealing with Blacky 
and one which Blum (1954) has employed. 


Association Test: 1 


One hundred words were projected onto a screen 
one at a time, and Ss were instructed to write 
down on a sheet of paper the first word that came 
to mind. Approximately four seconds intervened 
between the exposure of each word. The list on 
this day consisted of the first 75 Kent-Rosanoff 
words plus 25 other words which were the same 


ones rated on Semantic Differential scales (see 
below). 


California F Scale 


Thirty items were administered 
manner, but the reported results are 
ego-defensive items (Katz, McClintocl 


in the usual 
based on 21 
k, & Sarnoff, 


5 These were Pictures IT Ill 
M в П, ; IV, VII, and 


5 After completing the analysis with these two 


DPI indices, two further indices were devised in 
order to discover for future use whether there 
were more effective ways of using DPI measures 
The first supplemental method involved averaging 
the DPI ranks for those pictures on which а given 
5 had a conflict score equal to or greater than two. 
The second supplementary score was based on the 
three pictures having the highest correlations with 
а criterion measure, the difference in ingroup minus 
outgroup ше e y 
ntercorrelations of all items ан i 
ONR Technical Report No. 7 ( pen Ше 
Вой supplementary scores yielded results in the 
same direction as DPI indices (a) and (b), but 
they had generally somewhat less significant rela- 
tions to other variables. | 


1957), leaving out items placed by the authors of 
the scale (Adorno et al, 1950) in categories of 
"authoritarian submission" and "conventionalism. 
In those cases where both forms of the F scale 
were correlated with the same variables only small 
differences were obtained. High scores mean high 
F tendencies. 


Association Test: 2 


A second free association test, administered A 
the beginning of the second session, included th 
last 25 words of the Kent-Rosanoft list which ha 
not been given the first day. J 

The association responses were scored as opp 
sites, complements, and others. A score based no 
the total number of opposites associations (0 g 
Kent-Rosanoff list and to 25 additional words 
reported. 


Sorting Statements about Negroes 


One hundred and eight statements about Negi 
were sorted into nine piles on the basis of Ve 
ments of the extent to which the statements “Phe 
favorable or unfavorable toward the Negro fol- 
Hovland and Sherif (1952) procedure was ere 
lowed except that 9 instead of 11 categories Wi g 
used. A bimodality index was determined by d 
ing the number of items placed in catego rng 
1+2+8+9, dividing the sum by 4 and subtract} 
the sum of statements in categories 3+ e»t 
divided by 5. 

An analysis of the mean frequency of 
sorted into each of the nine categories by 53 ibu 
bimodality scores in the upper half of the distr the 
tion and by those in the lower half showe TH 
expected bimodal pattern in the first grOUP. «ito, 
more items in end categories than in the Puce 
The persons with low bimodality scores РГ 
rectangular distributions. 


items 
with 


Semantic Differential R atings 


sag 1 
The last task in this session involved тайы] о 
Concepts on 12 scales (Osgood, 1955). de 


concepts included а neutral filler word 8 


eat‏ ت 

3 Seven Statements were added to the test p 
the beginning, phrased in such a way that dise Р jar 
ment would be expected of those with authori g^ 
tendencies. This was done in order to disco ip 
а tendency to answer all questions simply bY sev 


a Set for or against the statements. These 
items were not scored, 


* The scales were labelled with these descri de 
terms: loud-soft, bad-good*, small-large #1, uel 
narrow#, awful-nice*, pungent-bland, pull? yay 
distasteful-tasty*, deep-shallow #, towar agit 
from, angular-round, fair-unfair*. Eva aste 
Scores were based on scales marked with a", 
isk; potency scores on scales marked with + 


iv? 


—M" 
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beginning, the six concepts from which ingroup 
and outgroup attitudes were derived (NEGRO 
ROOMMATE, NEGRO NEIGHBOR, FOREIGNER, FRIEND, 
FAMILY, AMERICANS), the ME concept plus the fol- 
lowing: DANGER, SAFETY, CLEAN, DIRTY, HONESTY, 
DECEIT, MASTER, SERVANT, THRIFTY, WASTEFUL, SUC- 
CESS, FAILURE, DOMINANT, SUBMISSIVE, ENEMY, 
EARLY. A random sequence of rating these con- 
cepts was used. Indices derived from these meas- 
ures will be described with the results. 


Aptitude Measures 


Scores on the 1949 edition of the American 
Council of Education Tests were available for 
most of this sample. The tests had been given by 
the Michigan Bureau of Psychological Services 
early in the academic career of most of the stu- 
dents. A large proportion of our sample was from 
the Sophomore class at the time of the observa- 
tions and had taken the test the year before. 


RESULTS 


Aptitude as a Variable 


The aptitude scores of subgroups result- 
ing from dividing the sample simultane- 
ously on projection and on reversal scores 
are found in Table 5. There were no reli- 
able differences in the aptitude scores of the 
subgroups when men and women were 
treated separately. Since the mean aptitude 
scores of men and women were not signifi- 
cantly different, they have been combined in 
order to test the difference in the aptitude 
scores of DPI subgroups.” It will be seen 
in Table 5 that those who had high scores 
on both projection and reversal (the high 
projection-high reversal group) had a sig- 
nificantly lower aptitude mean than did the 
low-low group; and the low projection- 
high reversal group had lower aptitude 
scores than the low-low group. This sug- 
gests à negative correlation of aptitude and 
formation. However, since there 
was not a significant difference in mean 
aptitude Scores of the high projection-low 
reversal and low-high groups, it may be 
concluded that aptitude was probably not 


reaction 


of .10 or less are reported in all 
indicated, these refer to 
two-tailed probabilities. When variances were nat 
homogeneous, degrees of freedom of f ep ye 
determined by the method described in Dixon an 


Massey (1951, рр. 104-5): 


9 Probabilities ( 
cases, Unless otherwise 


involved in comparisons of the differential 
effects of reversal (reaction formation) and 
projection.'? 

The significant relations (Pearson corre- 
lations) of aptitude with other variables 
were for men: ingroup attitude (—.48), in- 
group attitude minus attitude toward self 
(—.30), the total number of ideas ex- 
pressed in the Blacky stories (.31), and 
ingroup deviation scores (—.34). For 
women, correlation with the total number 
of ideas was close to significance (.27) and 
the correlation with opposites associations 
(—.31) was significant. Partial correlations 
have been determined for all the significant 
relationships between these variables (sig- 
nificantly related to aptitude) and other 
variables, holding aptitude constant. In no 
case did the correlations in question fall 
below significance as a result of this partial- 
ling procedure. 


Interrelations among Opposites Measures: 
Hypothesis I 


The opposites measures to be reported in- 
clude the following: the frequency of oppo- 
sites associations, the bimodality index de- 
rived from sorting statements about the 
outgroup, and a number of indices based 
on Semantic Differential ratings of 24 
concepts. 

Semantic differential measures. The first 
step involved analyzing the interscale cor- 
relations for each of 24 concepts. That is 
to say, the ratings of a given concept on 
each SD scale were correlated with ratings 
on every other one of the 12 scales. There 
were consequently 66 correlations for each 
of the 24 concepts. The results confirmed 
the close relations among the different eval- 
uative scales? already combined as measures 
of attitude and also demonstrated consist- 
ently high correlations between three other 
scales for each of the 24 concepts. These 
were the scales designated by the terms: 
LARGE-SMALL, WIDE-NARROW, апі DEEP- 
SHALLOW, which became the basis of potency 


10 Aptitude had a correlation of —16 (men) and 
—01 (women) with reversal and —.09 (men) and 
—07 (women) with projection. None of these 
correlations is significant. 
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ratings. Two kinds of indices were derived 
from these SD scores: deviation scores and 
difference scores. 


a. Deviation scores. The potency and 
evaluation scores of each person were con- 
verted to deviation scores, the distance from 
the midpoint of the scale to the point of 
rating without regard to sign. These devia- 
tions on the four evaluative scales were 
summed for each concept to produce the 
evaluative deviation score for that concept ; 
deviations on the potency scales summed 
over scales constituted the potency devia- 
tion scores. The potency deviation scores 
and evaluation deviation scores were then 
correlated over 19 concepts (excluding in- 
group and outgroup concepts). Table 1 
presents the proportion of 171 intercorrela- 
tions of potency scores that were positive 
and that were significant in a positive 
direction; the proportion of the 171 inter- 
correlations of evaluation scores and of the 
361 potency and evaluation correlations (19 
concepts each with an evaluation and a 
potency score) that fell into these classes. 

It will be seen that 100% of the inter- 
correlations of potency deviations were 
positive and 92% and 82% of potency cor- 
relations were significantly positive for men 
and women, respectively. The correlations 
were somewhat smaller among evaluation 


TABLE 1 


PERCENTAGE oF POSITIVE AND SIGNIFICANT CORRE- 
LATIONS BETWEEN DEVIATION Scores or 19 
Concepts on PorENCY AND EVALUATION 


SCALES 
Evaluation Potency | Evaluation 
Correla- | Correla- and 
tions tions Potency 
M 98.2%| 100% 94.4%, 
Positive 
W 95.9 100 94.4 
Signifi- M 69.5 92.0 31.0 
cantly 
Positive* | үү 53.8 82.4 42.9 
Number r’s | 171 171 361 
Total N —703 
“рчы” ЖК” Чыршы ү == 


scores but they were still highly significant 
in the positive direction. Even when po- 
tency deviations were correlated with evalu- 
ation, the proportion of positive correlations 
was far in excess of chance. 

These results support Hypothesis I for 
this set of measures by showing that differ- 
ent concepts rated on the same and dif- 
ferent scales have a tendency to deviate m 
the same degree and in the same direction. 


In Tables 2 and 3 deviation scores have 
produced six independent indices: (a 
average deviations on evaluative scales оя 
concepts that are clearly opposites (Еч): 
DIRTY, CLEAN, НОХЕЅТҮ, DECEIT, TERE 
WASTEFUL; (b) average deviations on | 
uative scales of concepts that do not nay 
clear opposites in the list (Еуһор): и 
ENEMY, EARLY, SERVANT, MASTER; (С) ахои 
age deviations on potency scales of conceP 
in (a) referred to as Ро; (d) ауа 
potency deviations of the concepts in ‘on 
referred to as Рош; (е) average deviate 
on evaluative scales of ingroup concep ê 
and (f) average deviation on evaluat! 
scales of outgroup concepts. ht 

b. Difference scores. It was thoug 
Possible that distance between rating? Ў 
pairs of opposite concepts might be ап in 
propriate measure of opposites strict 5 
and a general average difference score tive 
calculated, the mean difference in eval Е 
ratings of six pairs of opposite conc 
without regard to sign (Av Diff G)- f 

In addition, four other difference E 
have been employed: (a) Average : 08 
спсе on evaluative scales between "n 
Opposite concepts that might be though ma 
as having an anal reference С 
WASTEFULNESS, HONEsTy ppcEIT, СІ if 
DIRTY). This score is called Ev,. (b) 15 
ference on evaluative scales between 0С 
of nonanal opposites (SAFETY-DANGER: 
CESS-FAILURE, DOMINANCE-SUBMISS 00 
These are labelled Ev,,. (с) Different? g), 
Potency scales for the concepts И cy 
called Po, (d) Difference on poro 


scales for the concepts named in (b) 2 
called Pona. 


;; dif; 
c. Interrelations among seman 
ferential indices, Although the res" 


PR — 
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Table 1 suggest a tendency to rate concepts 
of various kinds similarly on different kinds 
of scales, it was important to examine pos- 
sible difference in degree of relationship 
when different classes of concepts were 
rated and when concepts were rated on 
evaluative or potency scales. 

When average correlations of difference 
Scores were compared, they were approxi- 
mately in this order: highest, different con- 
cepts rated on the same evaluation scales 
(r — .62* and .53* for men and women, 
respectively) ; next highest were correla- 
tions of the same concepts rated on potency 
and on evaluation scales (i.e., on different 
scales, 7's = .28* and .32*, respectively, for 
men and women); then, different concepts 
on the same potency scales (7's = .31* and 
.01) ; finally, different concepts on different 
scales (r = .19 and —.01, .16 and 07). 

The results were similar for deviation 
scores. When scales were the same, mean 
correlations were highest even though the 
concepts rated were different (r — .67* and 
.68* in men and in women for evaluation 
scales, and .74* and .69* for potency) ; next 
were the correlations between indices based 
on the same concepts rated on, different 
scales. (.40* and .66* when the concepts 
were opposites and .60* and .65* when they 
were nonopposites) ; finally, the lowest cor- 
relations appeared when both concepts and 
scales were different (.46* and .40* be- 
tween potency nonopposites and evaluation 
opposites and .33* and .61* between potency 
opposites and evaluation nonopposites). In 
all these correlations the number of cases 
was equal to 52 for men and 51 for women, 
and the correlations which were significant 
are starred. Correlations were converted to 
z scores before averaging. 

Opposites associations. There was еп» 
dence of a positive relation between fre- 

‚ of opposites associations when dif- 
quency oi opp d Th 
ferent stimulus words were used. e 
number of opposite responses to 100 Kent- 
Rosanoff words was correlated with the 

i onses to 25 other 
number of opposite resp 
word stimuli. For both men and women 
this relation. Was positive and significant 
(р < .001). When the relation of pq 
to words given on the first day and on the 


second day (one week later) was deter- 
mined, the chi squares were again signifi- 
cant and even higher. This makes it appear 
that a transient set to respond with oppo- 
sites on some one occasion was not respon- 
sible for the correlation. 

Bimodality $0715. It should be mentioned 
at this time that an earlier study (Peak, 
1959a) has demonstrated a significant rela- 
tion in both men and women between the 
bimodality scores derived from sorting two 
separate sets of items, one set referring to 
Negroes and the other set to the members 
of the ingroup. Correlations between these 
bimodality scores were .57 and .72 for men 
and women, respectively, both coefficients 
being significant at the .01 level. 

Interrelations of opposites associations, 
semantic differential indices, and bimodality 
indices. Table 2 presents the correlations of 
each of the indices based on Semantic 
Differential ratings with the outgroup bi- 
modality index and with the frequency of 
opposites associations. Since the Semantic 
Differential deviation scores and difference 
scores were derived from overlapping data, 
one or the other must be counted in sum- 
ming up the relations. 

In men, 12 of the 13 correlations between 
Semantic Differential deviation indices and 
the bimodality index, between the SD devi- 
ation indices and opposites associations, and 
between opposites and bimodality were posi- 
tive; 5 of the 13 were significant relations. 
For women 8 of the 13 were positive, none 
significant. If in this group of indices only 
those Semantic Differential indices were 
counted which involved evaluation scales, 9 
of the 9 opposite indices were positively 
correlated in men; 5 of these were signifi- 
cant. In women 6 of the 9 were positive, 
none significant. 

When the Semantic Differential differ- 
ence indices were counted along with the 
other opposites indices (bimodality and op- 
posites associations), then 10 out of the 11 
correlations were positive for men; 3 of 
these were significant. In women 8 of 11 
were positive, none significant. Once again 
when potency scores were omitted, 7 out of 
7 were positive in men, 3 being significant. 
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Six out of 7 were positive in women, none 
significant. 

In summary, Hypothesis I was supported 
clearly by the results from the male sample: 
а large proportion of the relations among 
Semantic Differential ratings were signifi- 
cantly positive; relations between bimodality 
indices were significantly positive, as were 
relations between frequency of opposite re- 
actions to different lists of words even 
when tested on different days. Thirty-eight 
per cent of the relations among bimodality 
indices, opposites associations, and Semantic 
Differential indices were significantly posi- 
tive at the .05 level when SD deviation 


indices were employed. In women Hypoth 
esis I was supported strongly by the corre 
lations among Semantic Differential meas- 
ures as well as between the two bimodality 
sorts and between the two measures of op- 
posites associations. There were no signifi- 
cant positive correlations between the thig 
types of measures (opposites, bimodality; 
and SD) for women. For both men an 

women, correlations between measures be 
came smaller when measuring methods di | 
fered, when concepts were different, V 
when different scales were used. Opposite? 
association measures were less clearly 16 
lated to Semantic Differential measures a? 


TABLE 2 
GENERALITY OF OPPOSITES TENDENCIES-CORRELATIONS (PEARSON) 
Bimodality- M .18 (50) 
Opposites Assoc Ww .20 (47)ь 
H H З 
Bimodality— M .32* Opp Assoc- M a m^ 46" 
Outgroup Dev Ww 42 Outgroup Dev W —.14 
Bimodality- M 45*» O 
рр Assoc- M .02 
Ingroup Dev w .02 Ingroup Dev м . 04 p 
Bimodality— M 36* [9] | 
[ к рр Аззос- М .10 
Еур Dev WwW -.01 Evop Dev W .20 
Bimodality— M 29* 
E Opp Assoc- 
EvNoP Dev W 08 ы ad XE - M 
Bimodality— 
Bow Des ы H Opp Assoc- M 15 
E Poop Dev үү .08 
Bimodality— M 
Pose» Dey W Е EC Opp Assoc- М —.01 
= Poxop Dev W —.01 
imodality- 
Av Diff G -o Sg Opp. Assoc- m 
№ .03 Ау Diff М Е 
Bimodali i E i 
1modality- M .37* 
Diff Eva W .01 me e M 16 
Ls ME EVA М 21 
imodality- M .20 
Diff Evx W -117 Moose и >Ш 
а 1 УМА үү 12 
imodality— M .08 
Diff Po, w Е Озе Assoc~ М —.16 
> iff Poa № —.08 
imodality- M 
Diff Pona Ww b Opp Assoc M .02 
Diff Pona WwW .06 
iN pees 52 ехсере for opposites assoc, № = 50, 


АЕ except for opposites assoc, № = 47. 


sp 
** p <.01. 
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to bimodality measures than the latter two 
were to each other. Moreover, bimodality 
indices were less clearly related to potency 
measures than to evaluation. 


Relationships among Defense Preferences: 
Hypothesis П 


Hypothesis И was supported by the fact 
that there was a significant negative rela- 
tion between preference for projection and 
preference for reaction formation Or 
reversal. 

Because DPI scores were derived from 
five statements made after the 
presentation of each Blacky picture, each 
statement representing preference for one 
of the five defenses, а spurious negative cor- 
relation would be expected among the 
scores. lt was necessary, therefore, to find 
a way of removing this correlation in order 
to determine the actual relation among DPI 
preferences." First, the average rank of 
projection statements and the average rank 
of those reflecting reversal preference were 
determined for each person and a scatter 
plot was made of the relation of the two 
variables, projection and reversal. Since 
five ranks were possible this resulted in a 
5 by 5 table." This table was then reduced 
aking one variable 


ranks of 


to 4 by 5 categories by m 
(viz., reversal) assume only four values. 
Thus, if a person's average projection score 
was highest (Rank 1) and reversal second, 
in the reduced table he retained Rank 1 on 
projection but was also given Rank 1 on 
reversal; if he had Rank 1 on projection 
and 4 on reversal, projection remained un- 
changed but reversal was converted to 
Rank 3: if projection was Rank 2 and re- 
versal 1, these two ranks were not changed, 
but reversal Ranks 3, 4, and 5 became 2, 3, 
and 4, respectively. And so on. In other 
words, the "reduced" variable took a rank 


“Та Join M. Gilbert of the Center for Advanced 
Study in the Behavioral Sciences suggested this 
method of handling the problem. We are also 
indebted to him for help in programming Our data 
for computer runs. . | 

12 Tied average ranks were assigned to one of 
the adjacent categories by a table of random 


numbers. 


of 1, 2, 3, or 4, columns or rows keeping 
their original order but having ranks inde- 
pendent of those taken by the first variable. 
A second 4 by 5 table was built from the 
same data by collapsing the other variable, 
projection, into four categories in the same 
fashion, while reversal retained the original 
five categories. 

These four by five tables were analyzed 
by the Mood maximum likelihood chi 
square method. For men the chi squares 
were significant at less than the .05 level 
{ог both tables relating projection and re- 
versal ; for women the corresponding proba- 
bilities were < .02 and < .01. In order to 
demonstrate the direction of this relation- 
ship the categories were further collapsed 
to two by two tables. Men and women were 
combined since the relation between the 
variables was the same. Phi coefficients for 
the two tables were —.44 and —.38.1° The 
chi squares from which these coefficients 
were derived were significant at less than 
the .001 level. 

Avoidance and reaction formation were 
positively related, the chi square being sig- 
nificant for women but not for men. Prefer- 
ence for projection was positively related 
to preference for regression (р < .05 for 
men and « .10 for women.) 


Hostility and Defenses 


No specific predictions were made re- 
garding the relation of defense preference 
and hostility, but it is of interest that projec- 
tion was positively related to the number of 
hostility items appearing in the stories (7's 
= 34 and .32 in men and women, respec- 
tively ; significant at the 05 level). Rever- 
sal was, on the other hand, negatively cor- 
related with the number of hostility items 
(r's = —.37 and —.36, both significant). 

It will be explained in the discussion that 
evidence of several kinds led to an hypoth- 
esis that reversal tended to be associated 
with restriction of psychological distance 
and difference between certain categories 
with consequent emphasis on category simi- 


سے 


13 With the given marginals, maximum phi for 
the first table was 940; for the second, .867. 
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larity. It was hypothesized that this might 
be expected to 


Blacky 


ideas pro- 
duced was negatively related to reversal 
preference (75 = —24 and —31 with р < 
women.) The 


positive and low (r — .06 and 14 for men 
and for women). The question then arose 
whether the association of number of hos- 
tile ideas with defense preferences was 
between total 
Scores. The 
positive relation of projection and percent- 
Significant for 


sal preference dropped when hostility was 
expressed as a percentage of the total out- 
Dut (7's = — 27 with P < .10 and —.17 with 
$ > ЛО for men and women, respectively). 

Table 5 reveals these same differences 
Somewhat more clearly. This Table repre- 
Sents the mean scores of the variables 
within Sroups resulting from dividing the 
sample simultaneously on the basis of pro- 


Jection and 


more productive than were those high on 
both projection and reversal, but not signifi- 
cantly so. The percentage of hostility items 
was significantly greater in high projection— 
low reversal conditions than in the low- 
high condition (р < 05) and the high-low 
group also produced a greater 
of hostility items than did th 


a closer relation between per cent hostility 
and projection than between per cent hosti]- 
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Women  high-high scorers produced 
fewer ideas than did those with high-low 
Scores (р < .05); high-low scorers ex- 
pressed more than those low on projection 
and high on reversal (р < .01). Results 
for the percentage of hostility items were 
in the same direction as for the male sample 
but the differences between group means 
were not significant for women. 


Defenses and Opposites: Hypothesis III 


The relations between opposites measures 
and defense preference have been analyzed 
by several different methods. There were, 
first of all, the different opposites indices to 
be taken into account, as well as different 
indices of defense preference. In the sec- 
ond place, the relations between defense 
preference and opposites scores have been 
expressed in two ways: in terms of Pearson 
correlation coefficients and by comparing 
the means of Opposites indices in the four 
groups resulting from dividing the sam- 
ple simultaneously on reaction. formation 
Scores and on projection scores. Only the 
comparison of means is reported at this 
time. Correlation coefficients were not 
markedly different in their implications from 
differences in means of the groups, but 
where there were differences the means are 
probably more accurate because the level of 
reversal and projection scores was con- 
trolled for each comparison, 

Tables 3 and 4 present the means for the 
four DPI groups resulting from dividing 
the sample simultaneously оп reversal an 
1 In Table 3, DPI indices 
from hostility pictures were used; in Table 
4, the indices were those derived from Ted 


„Tt was predicted in Hypothesis III tha 
high projection scores would be associate 
with high Opposite scores and high revers? 
Scores with low Opposite scores, Therefor” 
those high on Projection and high on reari 
tion formation should haye higher оров! 
Scores than the low-high DPI group, P" 


| 
| 
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jection varying and reversal constant and 
high. Moreover, high-low scorers should 
have larger opposites scores than low-low 
scorers, indicating the effects of projection 
with reaction formation constant and low. 
On the other hand, those with high projec- 


ІІ 


tion and high reversal preference should 
have lower opposites scores than the high- 
low group, and the low-high group should 
have smaller scores than the low-low 5$, 
both of these comparisons indicating the 
effects of reversal preference with projec- 


TABLE 3 


Opposite Scores or DPI Groups 
(Hostility Pictures) 


igh Proj- High Proj- Low Proj- Low Proj- Significance 
E R oy R High К LowR and 
I II III IV Direction 
of 
Mean 5 Меап 5 Меап 5 Mean 5 Difference 
i i 5 7.2 56 | 14.9 7.0 | 14.5 3.1 
Eu. M a 79 15$ са | H5. 44 | 112 90. ВЫ 
е. ны ге) 
26.3" * Sate || 22:91 OT туя 
0 27.3 13.9 | 21.1 8.8 | 
TUAM x 208 11.6 | 22.6 13.6 | 23.4 12. 26.5 10.5 
7 25.9 14.5 | I«II* 
7.9 | 30.7 12.3 | 27.5 16-1 к 
Do, А n4 мз | 359 12.3, | 208, IO SM р 7.6 
| aset 43-9 al es 
juo | and ао: 1 
Eis В 2 iS О ои ШШ ОЛ ырш 
Ae 10.1 | 44.2 BO | ек 
9 ins | 48.0. 6-3 | #8 
BE F 22 ex il sis а сао ого 48.2. ИЛ 
8.4 | 30.3 8.4 | I«II** 
7.5 | 32.4 5.8 | 28.1 
oni ¥ 03 Се И ES MEE e ще 5.5 
29.9 6.5 | II>IV*** 
Ў FEN CN UL зе А 
De л 265 Of | ses Ми ue 
Pie | pe e en 
Ponor 199 6.4 | 11.1 6.5 20.0 3.6 
Dev" tee а к ШЕ ane) ae BE 
ml NN Dem - 
Evy: „7 2 20.2 5.3 20.9 4.2 
T F s He 2:6 7 | 24 FI 2327 6.4 
29.9 6.5 | 28.3 6.8 | 28.9 6.0 
24 М $45 is 52 63 | 207 7.4 | 30.8 6.7 
M : 2.4 5.8 3.0 6.0 1.5 5:84 ЭМ 
ur F 43 222 6:5 13: ik 271 8.2 1.9 | I«IV**s 
Т<Ш** 
Р м | 2:0 951 9.6 2.2 322 126 35 17 
d E 4297 10-| x9 1.6 | 4$ 18| 28 aves | Pme 
" 7 16 17 10 
y ү 6 17 20 6 
bs .10. 
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tion held constant. Finally, the group with 
high projection scores and low reversal 
Scores should have higher Opposite scores 
than the one with low-high scores, both 
defense preferences varying in opposite 
directions. 

The relative size of the Opposites scores 
of the above pairs has been determined for 
men and for women, for each opposites 
measure. The number of differences in the 
predicted direction has been counted for 
two sets of comparisons, the first using 
Semantic Differential indices derived from 
deviation scores, bimodality Scores, and op- 
posites associations, and the second includ- 
ing the Semantic Differential difference 
Scores along with bimodality and opposites 
associations. These counts indicate the pro- 
portion of the comparisons of mean орро- 
groups with dif- 


In the section following, DPI indices based 
on hostility pictures Were used to determine 
groups; in the next Section after that, in- 
dices derived from conflict pictures were 
employed, 

Groups determined by hostility DP['s. 
‘In the first count using deviation scores, 
bimodality, Opposites associations 
women showed no Significant predicted dif- 
Opposite scores of the 


Was not a significant 


umbe es in the predicted 
direction, For the comparison of the high 


projection-low reversal group with the low 
projection-high reversa] group seven of 
eight differences in means were as pre- 
dicted (р = 035, one-tailed), 

For the male s 


varying. Twelve of the 16 comparisons be- 
tween high and low reaction formation 
means with projection constant were in the 
predicted direction (p = .038, one-tailed). 
Nine of 16 were in the predicted direction 
with projection varying and reversal con- 
stant. Comparing the high projection-low 
reversal means with low-high means, seven 
of eight were as predicted (р = .035, опе- 
tailed). 4 
On the second count, including difference 
scores for measures involving Semantic 
Differential ratings, plus bimodality and un 
posites associations, the results were in 
Same direction but less clear. Only when 
the average difference score was the meas 
ure was there a significant predicted differ 
ence in a Semantic Differential measure n 
men. One such measure was significant =. 
women, Pona. In men, 6 of 12 comparison? 
were in the predicted direction with projec 
tion varying; 9 of 12 with reversal varying 
(р = .073, one-tailed). In women, попе В 
the frequencies exceeded chance expecta 
tion, when difference scores were involve ^ 
Groups determined by conflict DPI A 
When the sample was divided on the Pai 
of ОРІ” calculated only from the pien 
on which an 5 showed evidences of conflic" 
then the results changed in two bo. 
(Table 4). The relations of DPI scores t 
the opposites measures became ѕотемћа 
more significant and the results for те! 
and women were more alike in that vw 
was no difference in the magnitude of FO 
effect of projection and reversal on oppos! 8 
Scores for men and women. The result 
for men and women have been combine" 
therefore, in testing the frequencies of rcl 
tions in predicted directions, it 
sing the deviation measures, bimodali У 
scores and opposites association scores: ^ 
of 32 comparisons were as predicted wh 
Projection was varying and reversal xg 
(P < 02) ; 24 of 32 were as predic 
When reversa] was varying and project d 


€ n 
held constant (р < 01). When men 038 
women were 


treated separately, р = 
and .105 with Projection varying; wit! леп 
Versal varying, P = .011 and .105 for zo 
and women, respectively, In 7 of 8 © 8 
parisons (5 — .035) for men and 6 ° 


h re 
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(p = .145) for women, high projection-low 
reversal means were greater than low pro- 
jection-high reversal means. 

With difference scores replacing devia- 
tion scores the result were in the same direc- 
tion, but the frequency of the differences in 
the predicted direction did not exceed 
chance. 
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In women there were four significant dif- 
ferences in the predicted direction out of 16 
comparisons: bimodality scores were larger 
for the high-projection low reversal group 
than for the high-high group, scores for 
the high-low group were greater than for 
low-lows and scores for the high-low 
group were greater than for the low-high 


жр <.01. 


TABLE 4 
Opposite Scores or DPI GROUPS 
(Conflict Pictures) 
High Proj- High Proj- | Low Proj- | Low Proj- 
i i Low R 
dd B E Е в: ту Significance and 
L Direction of Differences 
Mean 5 Mean 5 Mean 5 Mean s 
i i 6.2 6.0| 15.5 6.5|15.7 5.2 
аы: а 1% 16:2 60|1459 6.0 | 14.0 4.2 | I«IIs П>; ИТУ 
i 24.2 12.5 | 27.2 10.9 | 20.4 10.9 
oom у xs M 23`6 14.6 | 20.8 12.9 | 22.2 15.4 | I2 IIs >Ш 
| "n 28.6 16.9 
0G м | 95.0 11.5 | 30.4 44.8 | 24-1 142 | 28- 
Dev Е | 203 11.0 | 35.6 9.6 | 31.9 14.9 | 31.4 9.3 
раша 95 [344 1:6 00:6 ee 
Dev EIN озу их 485 $5995 4 I«IV* 
2 [ааа 7.8 | 38.9 8.1 | 47.9 8.9 | >II >I; HI<IV= 
Br М | 46.5 91| 50.0 7.8| 48.5 10.1|49.0 8.9 
" alai 4s|s8 64| 253 86| М1 7.5 тее ШШ II eIVe 
DN M | 38.5 gis 33.1 7.5 | 33.0 8.7 | 33-2 6.3 
Poo м | 281 8.4 | 28.9 8.1|21.9 8.1 | 21.5 7.7 | IIIS HI < И***; IIS IV 
Dev lG 85| 29.8 9.3|30.9 8.7 31.0 8.3 
м |187 5.2|20.9 $.4|16.6 6.3 | 17.2 5.1 | И>Ш*+ 
P Y |230 65| 20.4 7.5 | 22.7 6.5 | 23.1 4.5 
— — Tlu|a.6 as|zi 67| 17.2 59|252 31 |тошепэшееш AIV 
Eus M 222. $239. 61| 24.9 7.0 | 27.4 17.2 | 1<1У* 
—— "lum. 54|304 121252 61300 63 I III; П>Ш** 
€ м ists 59|326 6.2] 31.1 8.8 |311 5.5 
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group; the number of opposite associations 
in high-high scores was greater than in the 
low-high group. In men, 12 of 16 com- 
parisons were significant in the predicted 
direction when deviation indices were in- 
volved and 6 of 12 with difference scores. 

It must be concluded that Hypothesis IIT, 
though supported in part, needs certain 
qualifications. It is necessary to sample 
more systematically the content areas 
tapped by the various opposites measures 
described in this section in order to deter- 
mine what exactly was responsible for the 
higher association of the opposites series 
with defense preferences expressed in areas 
of conflict than in response to hostility pic- 
tures. It appears likely that preference for 
reversal is associated with low opposites 
Scores and projection with high ones when 
preferences are determined in situations 
comparable to those involved in the oppo- 
Sites measures. This is the plausible ex- 
planation of the closer relation between 
ingroup-outgroup difference and DPI 
scores based on hostility pictures; both the 
attitude measures and hostility pictures pre- 
sumably produced reactions to hostility. It 
can only be surmised that the general oppo- 
sites measures of Tables 3 and 4 activated 
categories that overlapped with conflict pic- 
tures to a greater degree than with hostility 
pictures. 

A second qualification is one that must be 
stated for almost every hypothesis in this 
study. Hypothesis ПТ was more strongly 
supported for men than for women, though 
tendencies were in the same direction for 
both. 

Finally, a comment should be made on 
the more significant relations obtained with 
SD deviation scores than with SD differ- 
ence scores. Since the latter were derived 
from differences in Semantic Differential 
ratings of pairs of concepts assumed to be 
opposites, a tendency to extreme reactions 
to both concepts would be obscured if Ss 
did not place the two concepts on opposite 
sides of the scales. This tended to be the 
case, for example, for the pair SAFETY- 
DANGER. Men, in particular, did not always 
tate DANGER as bad, in which case the dif- 
ference between the DANGER and SAFETY 


score tended to be small, even when both 
scores were extreme. This suggests the 
need for further attention to the nature of 
the structures being analyzed. 


Defense Preference and Attitude: 
Hypotheses IV and V 


Here, as in other sections, relationships 
have been analyzed by correlating the vari- 
ables and by comparing the means of рог- 
tions of the sample which have known levels 
of preference for reversal and projection. 
The latter analysis is reported at this time. 

Table 5 presents means of the important 
variables for each of the following groups 
of Ss: high projection-high reversal, high 
projection-low reversal, low projection- 
high reversal, and low on both dei 
Figure 1 indicates the means of the attitude 
variables in men and women for each 0 
these DPI groups. The horizontal lines 
represent evaluative scales with the mean 
ingroup (family, friends, Americans), МЕ, 
and outgroup positions (Negro neighbors; 
Negro roommate, foreigner) for each D 
group. The attitude positions of different 
groups are joined by solid lines for тей 
and by dotted lines for women. : 

The significant differences in these vari 
ables will be indicated in the succeeding 
paragraphs and their implications discuss! 
in the final section. 

Ingroup attitudes. Only one comparison 
of ingroup attitudes in the different DP 
groups reached an acceptable level of 518” 
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nificance: in women with reversal constant 
and high, there were more favorable atti- 
tudes toward the ingroup when projection 
was high than when it was low (р < 05), 
suggesting the predicted effect of projection 
on ingroup attitude. In men with projection 
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constant and high, there were more favor- 
able attitudes toward the ingroup when re- 
versal preference was low than when it was 
high (р < .10), which implies that reversal 
was making the ingroup attitude less favor- 
able. 


TABLE 5 


AVERAGES OF VARIABLES АЗ FUNCTION OF RE 


VERSAL AND PROJECTION 


(Hostility Pictures) 


igh Proj- | High Proj- Low Рго]- Low Proj- у 
E R tow В High R Low К P's | 
I II ш IV between Variables 
Indicated 
Mean 5 Меап 5 Mean 5 Mean s 
1 0| 110.5 15.2 | I«II* 
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Attitude w 26.1 vu ui ps 25 21.5 | 30.5 21.5 | III; IL» III; IT V5 
26.7 16. 5.9 26. 
Aptitude | M | 120.7 14.0 | 125.7 n ind ^u 134.0 177 
c ita ux DT d 121.3 20.7 | 133.1 17.8 | 1<1уь; <" 
7. ^а | 124. 
© Hostility M a5. 2.0 4.8 1% * P is ү II» IV*; II» II^ 
Ideas/Story| W әй ал 3.6 1.3 3. А 24 i: 
21.9 4.2| 24.4 2.0 | I<IV4; III CIV 
нЕ /Story т qum : ni 10 24.4 4.1| 28.0 3.9 I«II*; IL» Ша; IN <IVe 
eas, . 2 . ` 
7 7 II» 
я 6 9.0 67.3 10.0| 64.1 16.7 129 
Еве |M| ROPA D $ | 605 10:5 | 66.2 13.1 | III I«IIIs; П> 
үү | 52.5 9.8| 67.9 10. eet 
age / 7 16 17 10 
No. Cases E Б 19 20 | 6 | 
d 3 35 37 | 16 | 
C 1 | | | 
ар<.10. 
b p <.05. 
e p <.02. 
4р<.01. 
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Outgroup attitudes. In both men and 
women with projection constant and high, 
outgroup attitudes were significantly more 
favorable in those with high reversal pref- 
erence than in those with low (p = 05), 
which was predicted. In men with reversal 
constant and low, those with high prefer- 
ence for projection had a significantly less 
favorable attitude toward the outgroup than 
did those with low projection scores (p< 
01). The difference was in the same direc- 
tion for women but did not reach ап ас- 
ceptable level of significance. The high 
projection-low reversal group had signifi- 
cantly less favorable outgroup attitudes 
than did the low projection-high reversal 
group of men (р < 05). Again, a differ- 
ence in the same direction for women was 
not significant. These differences were in 
the predicted direction according to Hy- 
potheses IV and V. 

There were no significant differences in 
attitude toward the self-concept (ME) as a 
function of preference for reversal or pro- 
jection, though in both men and women 
high-high scorers were less favorable to ME 
than high-low scorers (р < .10 for 
women) ; that is, with projection constant 
and high there tended to be more favorable 
self-attitudes associated with low reversal 
than with high reversal preference. 

Ingroup-outgroup attitude differences. 
When outgroup attitude ratings were sub- 
tracted from ingroup ratings for each 5 and 
the differences averaged within each DPI 
group without regard to sign, both men and 
women showed maximum difference in the 
group with high preference for projection 
and low preference for reversal and less 
difference for the high reversal groups. 

Scores of men and women have not been 
combined in some analyses because of sig- 
nificant differences in means of the two 
samples. However, differences between 
men and women were not significant for 
the mean ingroup-outgroup attitude differ- 
ences, for mean ingroup-ME attitude differ- 
ences, or for mean outgroup-ME attitude 
differences, as indicated in Table 6. There- 
fore, the two samples have been combined 


in analyzing these difference Scores (C in 
Table 5). 


The association of small ingroup-out- 
group differences with reversal preference 
was demonstrated in the significant differ- 
ence between the high reversal-high projec- 
tion group and the low reversal-high pro- 
jection group (р < .01). However, there 
was not a significant difference in the high 
reversal-low projection and low-reversal 
low projection groups. In short, the effects 
of reversal preference were clear only with 
projection preference high and constant. 
The comparison of the high projection-low 
reversal group and the low projection-high 
reversal group showed a significant differ- 
ence in the expected direction (р < .02), а 
difference which cannot be attributed ex- 
clusively to reversal or to projection. The 
significant difference in the high projection- 
low reversal and low projection-low H 
versal groups (p < .05) suggests that with 
reversal constant and low, high preference 
for projection favors larger ingroup-out- 
group differences. On the other hand, the 
expected effects of projection preference 
did not appear when reversal was constant 
and high and projection varied. These 1 
sults suggest that when both reaction forma- 
tion and projection were favored defenses: 
the former obscured the effects of the lat- 
ter, so that effects of projection appeare 
only when reversal preference was low. | 

ME-oulgroup differences, The mean dif- 
ferences in this pair of attitudes follow 
the same pattern as the ingroup-outgrouP 
differences, and for the combined sample 
the same comparisons of DPI groups We 
significant except that the comparison ES 
the high projection-low reversal group Wit t 
low projection-Iow reversal group did 9° 
reach a significant level. К 

Ingroup-wg differences. Since y 
Were significant. positive correlations ре, 
tween attitudes toward the concept ME 2 4 
toward the ingroup (Pearson 7s =. T 
-45 for men and women, respectively) i. 
Was not surprising to find that these po 
tudes tended to change together from sw- 
DPI group to another. There were: tal 
ever, two significant differences for the sal 
sample. The low projection-high reve 
group showed greater wr-ingroup di 
than the low-low group (р < .01). 


OPPOSITES STRUCTURES, DEFENSES, AND ATTITUDES 17 


comparison reflects the association of rever- 
sal preference with increased difference in 
self- and ingroup ratings when compared 
to low reversal. No significant effects of 
projection on Mr-ingroup differences were 
revealed in the group comparisons though 
high projection groups also showed some- 
what greater ME-ingroup differences than 
did low projection groups when reversal 
was constant. 

The combination of high reversal and 
high projection preference produced larger 
тетоир-ме differences than did any other 
condition and low preference for both de- 
fenses produced the smallest differences (р 
« .01 between these two groups). In other 
Words, self-attitudes and ingroup attitudes 
were farther apart with combined prefer- 
ence for projection and reversal than with 


preference for neither. 


Analysis of Separate Ingroup and 
Outgroup Concepts 

ere defined for our 

f attitudes to- 

nericans. Out- 

ions to Negro 


Ingroup attitudes w 
purposes as а combination о 
ward family, friends, and An 
group attitudes included react : ‹ 
neighbor, Negro roommate, and foreigners. 
Predictions about the composite indices 
have been tested and reported. It is im- 
portant, however, to examine two further 
questions: (a) Are the results characteristic 
of each component of the ingroup and out- 
group indices? (b) Are there differences 
in the organization of ingroup components 
and of outgroup components as à function 
of defense preference? No predictions 
were made about this second point but it 1s 
clear that intracategory relationships must 
be described їп any complete account of 
structures. For example, Bruner, Good- 
now, and Austin (1956) and Pettigrew 
(1958) have reported individual differences 
in category breadth and others have as- 
sumed that prejudiced persons ignore dis- 
tinctions within the outgroup. The present 
data provide information only about evalu- 
ative ratings of components of ingroup and 
outgroup categories, but it is of interest to 
note relations among these properties even 
though they are incomplete aspects of total 


ingroup-outgroup structure. 


The relation of defense preference to dif- 
ferences т component ingroup-outgroup 
concepts. Mean attitude ratings for each of 
the six concepts listed above as well as 
mean difference in ratings of the concepts 
without regard to sign have been deter- 
mined within each DPI group. 

Relative distances between the attitudes 
involving Negroes (roommate and neigh- 
bors) and each of the ingroup concepts 
within DPI groups were those reported 
for the composite indices. In every one of 
the 12 comparisons (6 for men and 6 for 
women) the greatest Negro-ingroup dis- 
tance was shown in the high projection-low 
reversal group. 

In men the same was true of the three 
comparisons of ratings of FOREIGNER and 
each of the ingroup concepts. However, 
this was not the case for the women Ss who 
did not react to the concept FOREIGNER as 
opposite to the ingroup concepts. In the 
first place, the mean rating of this concept 
was related to DPI scores as were ingroup 
concepts, tending to become more favorable 
in the high projection groups and less 
favorable with high reversal preference. 
And, secondly, women with high reversal 
preference showed the greatest difference 
between FOREIGNER and AMERICAN, FOR- 
EIGNER and FAMILY, and FOREIGNER and 
FRIEND. This was the typical pattern of re- 
lations between ingroup concepts, as will be 
indicated in the next section. 

In short, the a priori assignment of the 
concepts in question to ingroup and out- 
group categories appears to have been justi- 
fied in the case of men. For women atti- 
tudes toward the concept of FOREIGNER 
were evidently not operating like the atti- 
tudes toward the concept of the NEGRO. 

Within-category distances. Table 6 indi- 
cates that differences were smallest between 
FRIEND and FAMILY, NEGRO ROOMMATE and 
NEGRO NEIGHBOR, AMERICAN and FAMILY, 
and AMERICAN and FRIEND, all within-cate- 
gory pairs. The FOREIGNER-NEGRO ROOM- 
MATE and: FOREIGNER-NEGRO NEIGHBOR 
pairs were also thought of as involving 
within-category comparisons, but the dif- 
ferences between those pairs of attitudes 
were slightly larger than the distance be- 
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TABLE 6 


Mean DIFFERENCES IN ATTITUDES 
Total Sample (№ = 101) 


Pair Mean 5 
|Friend-Family| 1.9 2:5 
[Neg Rm-Neg № Si 4.8 
|Amer-Family| 5.3 6.3 
|Amer-Friend| 5.7 5.1 
|For-Amer| 6.2 5.2 
|For-Family| 8.1 6.2 
|For-Friend| 8.3 2.0 
[Neg N-For| 9.7 7.7 
|Neg R-For| 9.7 8:5 
{Neg N-Amer| 12.6 10.0 
[Мек R-Amer| 12.7 10.6 
|Neg N-Family| 15.7 10.0 
|Neg R-Family| 15.8 11.0 
|Neg R-Friend| 16.0 11.0 
[Мек N-Friend| 16.4 10.0 


tween FOREIGNER and AMERICAN, FOREIGNER 
and FRIEND, and FOREIGNER and FAMILY. 
Here again the concept of FOREIGNER was 
clearly marginal as an outgroup concept. 
Nevertheless, distances were in general 
smaller between components of the ingroup 
as originally defined and between com- 
ponents of the outgroup than across these 
Categories, except where the FOREIGNER 
concept was involved. 

When distances between ingroup atti- 
tudes were examined as a function of de- 
fense preferences, the following results 
were obtained. There were no significant 
differences between DPI groups in mean 
FAMILY and FRIEND attitude differences. 
However, the mean attitude difference be- 
tween AMERICAN and FAMILY was signifi- 
cantly larger (р < .01) in the low projec- 
tion-high reversal group than in the high 
projection-low reversal group. Differences 
between AMERICAN and FRIEND attitudes 
were also significantly larger (р < :02) for 
the low projection-high reversal group. 
The mean of the three ingroup differences 
was significantly larger for those preferring 
reversal than for projectors (р < .05). 

The mean difference in attitudes toward 
NEGRO ROOMMATE and NEGRO NEIGHBOR was 
not significantly related to defense prefer- 
ence. However, the NEGRO ROOMMATE- 


FOREIGNER comparison and the NEGRO 
NEIGHBOR-FOREIGNER comparison showed 
relations to DPI that were more like those 
we came to expect in comparisons between 
categories; i.e., attitude differences in the 
first pair were greater for the high projec- 
tion-low reversal Ss than for low-high Ss 
(р < .10), and the high projection-high 
reversal group had greater mean difference 
in these attitudes than did the high-low 
group (р < .05). Mean attitude differ- 
ences between NEGRO NEIGHBOR and FOR- 
EIGNER were also significantly different (р 
< .05) between high-lows and low-highs, 
the first group showing the larger mean 
difference. 

In summary, attitudes toward the three 
ingroup concepts behaved in a similar man- 
ner and there was a tendency for high pro- 
jectors to have smaller distances between 
ingroup concepts, while those favoring Te- 
versal had greater ingroup differences. For 
the three outgroup concepts, relations be- 
tween NEGRO and FOREIGNER had the char- 
acteristics of between-category relations 
such as have been reported for ingroup- 
outgroup composite indices. High projec- 
tion preference was related to greater dif- 
ferences and reversal to smaller differences: 
Since NEGRO ROOMMATE-NEGRO NEIGHBOR 
differences showed no relation to D 
scores, no generalization can be made about 


mean outgroup attitude differences a5 2 
function of DPI scores. 


Е Scores, Opposites, and 
Defense Preference 


Scores on the California F Scale we? 
not significantly related to any of the ge 
eral opposites measures, They were, how 
ever, significantly related to a difference и. 
ingroup and outgroup attitudes in both m 
and women (р < .05 for men and < 
for women). oh 

High F scores were associated with Eo 
preference for projection and with ill 
preference for reversal. In Table 5 it 


«oh 
be seen that the mean Е score of the PE. 
projection-low reversal group was hi 

an 


than that of any of the other groups 


"NE : he 
was significantly higher in women tha? t 
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low projection-high reversal group (prs 
.05) or than the high projection-high re- 
versal group (р < .01) which had the low- 
est F score in both men and women. In 
men the high-low group had higher F 
scores than did the high-high group (p< 
10). This negative relation of F to rever- 
sal would not have been expected from 
theories of authoritarian personality struc- 
ture, which have assumed a tendency to 
reversal preference as characteristic of the 
high F individual (Adorno et al, 1950). 

Moreover, the fact that the combination 
of high projection with high reversal was 
associated with the lowest Е scores in both 
men and women suggests as already noted 
that reversal may be a defense against the 
tendency toward projection in this group. 
The effects of reversal seem to have been 
dominant here as they were in determining 
ingroup-outgroup distance. 

In other respects the F score results were 
the familiar ones. High ЁЗ had more 
favorable ingroup and less favorable out- 
group attitudes as well as a greater distance 


between these attitudes. The higher the F 
score, the smaller was the шетоир-ме dis- 
tance and the greater the mME-outgroup 
distance. 


Sex Differences: Hypothesis VI 


The primary reason for analyzing the 
difference in reactions of men and women 
was to determine whether the samples could 
be combined. There was a sufficient number 
of significant differences to suggest that 
separate analyses should be made. As al- 
ready indicated, in those cases where the 
differences were not significant, the samples 
were pooled. 

Table 7 compares the average scores of 
men and women on the principal variables 
in the study. The following differences 
were significant at the .05 level or less. In- 
group, outgroup, and selí-attitudes were 
more positive in women than in men, Of 13 
opposites indices all but one showed women 
to have larger opposites scores than men 
and for that exception (opposites associa- 


TABLE 7 
Mean Scores MEN AND WOMEN 
Men Women 
D 
Difference 
Mean E N Mean 5 N 
Outgro TT 75.8 21.9 52 89.0 23.9 51 <.01 
ее 112.8 12.5 52 121.5 8.7 51 <.001 
ME Айна 98.4 18:9 | 52 | 105.6 15.6 | 51 | <.05 
[Ingroup-Outgroup| 2/5 MM. | 52 зал 25.4 | Si | сш 
|ME-Outgroup| P 28.9 20.8 52 25.5 22.1 51 >.10 
[Ingroup-ME| 18.7 13.2 52 18.3 12.2 Su ЕП) 
Bimodality 12 5.6 | 52 6.1 5.8. | 5E | 5.001 
[Ау Dif] 25.9 6.7 52 29.1 5.7 51 «.01 
Opposites Assoc 32.6 13.1 50 32.6 14.7 47 | >.10 
Deviation Oltteroup 27.0 13.6 52 32.4 115 51 | «.05 
Deviation Ingroup 42.3 11.5 52 49.9 8.3 51 <.001 
©, Hostile Ideas 4.3 1.6 51 3.4 1.4 51 «.01 
No. Ideas/Story 22.1 4.9 51 26.2 4.7 51 <.001 
pc 65.9 12.8 52 63.0 11.5 51 >.10 
Reversal” 16.5 3.3 51 14.5 3.8 51 «.01 
Projection” 14.5 3.9 50 16.0 4.1 51 <.10 
Aptitude 125.0 18.9 48 122.5 19.8 49 >.10 


a High score, favorable attitude. 


^ High score, low preferen 


Divide by 12 to determine average scale position on 11 point scale. 
nce. Divide by 5 to determine average rank, These scores are derived from the Hostility pictures. 
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tions) men and women had equal scores.** 
On 10 of the 13 opposites measures the 
women's scores were significantly larger 
than the men's at the .05 level. 

А similar tendency to extreme reactions 
in women has been reported by Osgood, 
Suci, and Tannenbaum (1957, p. 234) and 
by Shapiro and Tagiuri (1959). It is to be 
noted, however, that the average distances 
between ingroup and outgroup, ME and out- 
group, ME and ingroup were not signifi- 
cantly different in men and women. 

Men showed a higher percentage of hos- 

tility items (р < .01) ; women produced а 
greater number of ideas on the Blacky 
stories (р < .001). Men gave higher pref- 
erence ratings to projection than did 
women (р < .05, one-tailed predicted) and 
women were higher on reaction formation 
(P < .01). This last item supports Hy- 
pothesis VI. 
, Perhaps the most striking sex difference 
in this study is the fact that relations be- 
tween variables were in general much 
Stronger in men than in women, though in 
practically all cases results were in the same 
direction. Only relations of F scores to the 
other variables were clearer in women. But 
again, the results were uniformly in the 
same direction for men and women. 

It is not possible to be sure of the reason 
for this result but two hypotheses suggest 
themselves, First, it may be that the DPI 
measures were more valid for men than for 
women. It has been suggested, for ex- 
ample, that Ss tend to interpret the name 
Blacky, the central figure of the DPI test 
of that name, as a male and that this makes 
it more difficult for women to identify with 
Blacky. On the other hand, the F scale was 
first standardized on women and may pos- 
sibly be somewhat better adapted to them. 

In the second place, it may be that 
women have less clearly crystallized and 
organized concepts in the areas which we 
have studied; this would result in more 
independence of reactions to different situa- 


tions. Such a theory is compatible with the 


observation that there were consistent rela- 


14 Only five op 


s posites measures have beenie 
in the Table, but entered 


all were compared, 


tions between opposites measures in women 
only within rather circumscribed areas; 
viz., between Semantic Differential reac- 
tions, between different bimodality scores, 
between opposites scores based on different 
lists responded to on different occasions, 
but very little consistency across these dif- 
ferent opposites situations. 


DISCUSSION 


Characteristics of the defenses of rever- 

sal and projection, which have been ob- 
served in this study, must now be sum- 
marized and interpreted. High preference 
for reversal as a defense was associated 
with small differences between ingroup and 
outgroup attitudes, between ме and out- 
group attitudes, and under certain condi- 
tions with low opposites scores. There was 
also a positive relation between this defense 
preference and favorable outgroup atti- 
tudes, less favorable self-attitudes, 10% 
projection scores, low Е scores, low fre- 
quency of hostility items, and of total out- 
put of items in the stories written about the 
Blacky pictures. The difference in self- and 
ingroup attitudes was significantly greater 
when reversal preference was high and pro- 
jection constant and low. Attitude distances 
between the three components of the in- 
group concepts (FAMILY, FRIENDS, AMER- 
ICAN) were larger in the low projection 
high reversal group than in the high 
projection-low reversal group. 
, On the other hand, a preference for pro" 
jection tended to be related to greater dif- 
ferences between ingroup and outgroup at- 
titudes, to more favorable ingroup attitude? 
to less favorable outgroup attitudes, t? & 
high percentage of hostility responses, to 
high F scores, and to high opposites score? 
under certain conditions, Projectors tent e 


; - on 
to rate Jngroup concepts closer together р 
evaluative scales, 


Styles of Dealing with 
Psychological Distance 


An effort has been made to incorpora 
these results into а general theory in уыс 
Psychological distance has been employe 
аз а central construct in dealing with the 
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problems of motivation (Peak, 1955), per- 
ception (Peak, 19584), attitude change, 
and generalization (Peak, 1958a, 1958b, 
1958c). 

In the motivation article (Peak, 1955) а 
motive state has been described as an acti- 
vated structure consisting of two or more 
related events which are (a) close enough 
together in psychological space, by virtue 
of sharing a common dimension or because 
of associative linkage, to result in a rela- 
tively high probability of mutual activation 
of one event by arousal of the other, but 
(b) sufficiently disparate $0 that the two 
events are not reacted to as the same. For 
example, a system including the painful 
thought of the “D” І made on the last quiz 
and the pleasant thought of the grade of 
"A" may have such a relation. As long as 
the appropriate amount of disparity and 
association exists between the concept of 
"D" with its negative affect and the concept 
of *A" with its more positive affect, there 
Will be a tendency for the arousal of one of 
these events to result in the arousal of the 
other one, with resulting activation of overt 
behavior. Moreover, this activity will tend 
to persist until the distance between the 
events is either increased or decreased 
markedly. Thus, anything which reduces 
the disparity between the terms of such a 
structure to the point where they are re- 
acted to as the same will remove the moti- 
vational activity. This could happen, for 
example, if 1 actually received an A on an 
examination. It would also happen a5 a 
result of reacting defensively to the А and 
the D states as though they Were not differ- 
ent. On the other hand, whatever increases 
the distance from the A state to the per- 
ceived state, like making an E on an exami- 
nation, may increase the distance between 
the terms of the system to the point where 
one event, the thought of the grade of E, 
no longer elicits the other, the thought of 
"A" with sufficiently probability to 
maintain activation in the system. Such an 
increase in disparity тау also be pal 
plished defensively and unrealistically y 
changing the perceived position of one or 
both terms of the disparity 1n such a мау 
that their difference 15 exaggerated. 


This, then, implies two basically different 
ways of reacting which may constitute de- 
fensive adjustments. In the one case, the 
distance or difference between two terms of 
a motive structure is reduced, causing 
activity which stems from this motive sys- 
tem to cease. The results of the present 
study suggest that those preferring reversal 
may have developed a style of reacting in 
this way to disparity between ingroup and 
outgroup attitudes. In the other case, psy- 
chological distance is presumed to increase 
in a manner compatible with the behavior 
of projectors in this study. It would be as- 
sumed that these styles of adjusting have 
been learned as methods of reducing dis- 
parity in particular situations and that they 
have become the favored styles of reacting 
where disparity could not be reduced 
directly by reaching goals. . 

When the results were viewed т this 
framework it became apparent that the 
theory had other implications which could 
be checked in our data. We have found re- 
versal to be negatively related to the number 
of hostile items in the Blacky stories and 
projection related positively to those items. 
Tt was not at once apparent how these find- 
ings could be explained in terms of styles 
of dealing with distance. Then, the follow- 
ing hypothesis occurred to us. The number 
of hostility responses would be increased by 
greater total production of ideas; con- 
versely, these scores would be decreased by 
restrictions on output. It also appeared 
reasonable to assume that a student who is 
writing stories for two minutes will usually 
operate under a self-administered set which 
prevents him from repeating the same idea 
within this brief period. Presumably, the 
person who tends to neglect differences will 
regard more ideas as the same. Conse- 
quently, such a person (with reversal pref- 
erences) would have fewer different ideas 
available and would consequently tend to 
produce less material within the two-minute 
period. This would result in lower hostility 
scores. 

As already noted, the emergence of this 
hypothesis, after the data were collected and 
the main analysis completed, led us to count 


the number of “ideas” produced in the 
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course of writing the projective stories and 
the results tended to support the hypothesis 
by revealing a smaller number of ideas 
produced by those preferring reversal (р 
< .10 for men and р < .05 for women). 
The effect of projection on the number of 
ideas produced was not clear. However, the 
percentage of hostile ideas produced by 
those preferring projection was signifi- 
cantly greater than for those not preferring 
projection (р < .05 men; р < .10 women). 
Blum (1954), Nelson (1955), and Erik- 
‚зеп and Browne (1956) have all provided 
evidence of a greater tendency to per- 
ceptual defensiveness on the part of those 
who prefer the "repressive" mechanisms of 
reversal and avoidance. Such findings 
might be related to the present theory in 
the following manner. It would be expected 
that if distances and differences between 
parts of a stimulus picture involving con- 
flict were habitually neglected, the patterns 
and contours of such a picture would not be 
readily identified. On the other hand, any 
tendency toward picking out and emphasiz- 
ing differences should make for easier iden- 
tification. The "repressor" would in this 
Case be more likely to show perceptual 
defense. 
Such individual tendencies to exaggerate 
or to underestimate distance recall the in- 
teresting observations of Klein (1951) and 
his colleagues which have shown certain 
Anschauungen to Characterize individual 
Styles of adjustment. In particular, the 
tendency of some (levellers) to emphasize 
sameness and of others ( Sharpeners) to see 
differences is close to the present formula- 
tion at the descriptive level, 


Generality of Styles of Dealing 
with Distance 


It is important to emphasize once again 
that neither our theory nor our data imply 
that the style of reacting to distance is 
necessarily general over al] Situations for 
all people. As already noted it has been 
shown that defense preferences themselves 
are not completely general (Blum, 1957). 
Moreover, while the present study has re- 
vealed correlations between Opposites meas- 


ures and between opposites and defense 
preferences when conflict pictures were 
used to identify preferences, the correla- 
tions were not all equally high nor were 
they all significantly positive. A more sys- 
tematic sampling of situations is required 
before the question about degree of gen- 
erality of these or any other characteristics 
can be answered satisfactorily (Brunswick, 
1947). Meantime, it appears that there 15 
indeed some tendency toward generality, 
for a large proportion of the correlations 
between measures of opposites were posi- 
tive and very few were negative. Further- 
more, those preferring projection did more 
often give evidence of exaggerating distance 
in comparison to nonprojectors and those 
preferring reversal more often underesti- 
mated distance. 


Nevertheless, a more qualified statement 
of the theory is required and one which 
recognizes some of the complexities of de- 
scribing psychological structure, It may be 
stated in this fashion: if projection 15 
chosen as a defense in some situation, then 
distances between the concepts and attitudes 
involved in the motivational System in ques- 
tion have a tendency to be exaggerated; 1 
reversal is preferred, distances tend to be 
reduced. However, the exaggeration 0 
intercategory distance by the projector may 
be accompanied by smaller mean differences 
within certain categories. In the present 
study the high projection-low reversa 
group showed significantly smaller mean 
differences between the attitudes toward the 
concepts constituting the ingroup category 
(FRIENDS, FAMILY, AMERICANS) than di 
the low Projection-high reversal grou? 
There was, however, no evidence of a РЕ" 


related difference between the outgrouP 
Concepts here used. 


Now, the observation that smaller i! 
group attitude differencegis and larg? 
m&roup-outeroup differences character? 


ا 
There i ; small‏ 15 
here 15 а question as to whether the due‏ . 

ingroup differences for 


attitudes which might impose a ceiling effect 
would tend to reduc 


pees paia) up 
1 е the variation in ing’? 
attitudes, 


d 
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projectors need not embarrass the notion 
that certain manipulations of distance are 
characteristic of defensive styles, for a 
small mean ingroup difference may have the 
effect of increasing distance between in- 
group and outgroup, the clarity of any such 
distinction between parts of a system rest- 
ing as it does on the magnitude of differ- 
ences between the components of any one 
category as well as on the distances between 
these components (ingroup) and the com- 
ponents of the other category (outgroup). 
Table 8 suggests that for projectors the 
mean intercategory distance’? and the mean 
intracategory distance (ingroup) are more 
different (difference in means = 36.0; s, = 
4.8) than was the case for the low projec- 
tion-high reversal group (difference in 
means = 19.3; sa = 4.3). 

In short, if for the reversal group dis- 
tances between the ingroup components are 
not distinguishable from the distances that 
Separate ingroup and outgroup components, 
boundaries will not be clear between in- 
Sroup and outgroup. Indeed, such an or- 
Sanization approximates one instead of two 
Categories, and suggests that there has been 
assimilation of ingroup and outgroup atti- 


16 The 1c-0G difference entered in Table 8 is the 
mean of the mean of the nine differences between 
three ingroup and three outgroup attitudes calcu- 
lated for each S and divided by three. This pro- 
cedure makes the intracategory and the inter- 
category differences comparable. 


tudes making the former less favorable and 
the latter more favorable. On the other 
hand, contrast may be involved in the 
greater ingroup-outgroup attitude differ- 
ences of projectors (Peak, 1958a). 


SUMMARY 


In a program concerned with psycholog- 
ical structure and attitude change it is 
necessary not only to find ways of discover- 
ing the existence of different structures but 
also to determine how they are reflected in 
attitudes and other psychological processes. 
Studies have been reported on the interrela- 
tion of measures of opposites structure, and 
the relation of such measures to preference 
for certain defense mechanisms, to the or- 
ganization of ingroup and outgroup atti- 
tudes, and to other variables. 


The psychological distance between 
events and the degree of their oppositeness 
was assumed to be a determinant of re- 
sponse in the following situations: (a) 
ratings on evaluative and potency scales of 
19 concepts consisting of 6 pairs of oppo- 
site terms and 7 other terms, (b) the pat- 
tern of sorting statements about Negroes 
into categories, and (c) giving free asso- 
ciations to a number of words. Defense 
preferences were determined by the Blum 
Defense Preference Inventory. Attitudes 
toward the ingroup (friends, family, Amer- 
icans), toward the outgroup (Negro room- 
mate, Negro neighbors, foreigners), and 


TABLE 8 


WITHIN AND BETWEEN-CATEGORY DIFFERENCES IN ATTITUDES 


High Projection— 


P of Difference Low Projection- 


Low Reversal Preference in Means High Reversal Preference 
Mean 5 Меап 5 
|IG-OG| 46.1 24.4 <.05 34.4 20.5 
|ME-OG| 33.4 24.1 <.10 24.8 20.1 
|IG-ME| 17.4 13.4 2.10 20.1 12.1 
ИС Dift| 10.1 8.9 «.05 14.9 11.2 
N 35 37 
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toward the self were inferred from ratings 
of concepts on Semantic Differential scales. 
Approximately 100 students served as Ss. 


1. Preference for projection and for re- 
action formation or reversal were nega- 
tively correlated as had been predicted. 

2. There was support for the prediction 
that high preference for projection and low 
preference for reversal would be related to 
а tendency to opposites structuring, and 
these relations were clearest when the 
defense preference was determined from 
the Blacky pictures which reflected conflict 
in an individual. 

3. As predicted, greater distances be- 
tween ingroup and outgroup attitudes were 
positively associated with a preference for 
projection, except when Ss were high in 
preference for both reversal and projection. 
Projection was also related to favorable in- 
group attitudes (significant in women), to 
unfavorable outgroup attitudes (significant 
in men), and to larger differences in ME- 
outgroup attitudes, 

4. As predicted, high preference for re- 
versal was positively related to favorable 
attitudes toward the outgroup in both men 
and women. There was also a significant 


negative relation of reversal scores with 


mgroup-outgroup distance, as well as with 
ME-outgroup distance, 


‚5. It was also shown that 
ring reversal tende 
ideas in writin: 


those prefer- 
d to produce fewer total 
riting the Blacky stories than did 
those not in that DPT Category. This is also 
interpreted as depending on a style of deal- 
ing with distance. 


6. It is proposed that these different cor- 
relates of projection and of reversal pref- 
erence may be understood by a theory 
which assumes that when projection 15 
adopted as a defense, distances between 
attitudes (and concepts) involved in а 
motive system are increased by contrast to 
the point where relations between the con- 
cepts (ingroup and outgroup) are dis 
rupted and motivation reduced. On the 
other hand, when reversal is the preferred 
defense, motivating activity is removed de- 
fensively by treating differences between 
these events as though they were negligible, 
thereby producing assimilation and reducing 
disparity in the system below the amount 
that must be present in order for activity te 
continue. 

7. Scores on the California F Scale bad 
the expected positive relations to ingroup- 
outgroup attitude difference, High F scores 
were associated with high preference for 
projection but with low preference for FÊ 
versal. This correlation with reversal СОЛ 
tradicts the usual assumption of а positive 
relation between these variables. 

8. As predicted, women tended to prefer 
reaction formation as a defense more бе 
than did men and men preferred project" 
more often than women did. Women А 
consistently higher average opposites SCC" 
than men on every measure, except oppo 
Sites associations. The different relatio!" 
between different variables tended to 
weaker for women than for men with m 
exception of relations to F scores. 
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: | 1 пе report presented in this monograph 
represents the accumulated evidence 


from the first four years of a continuing 
project of research in recognition. The em- 
phasis in the series of experiments reported 
herein has been upon the effects of varia- 
tions in the conditions of perceptual learn- 
ing on the responses of recognition to 
ambiguous cues. In one or more of these ex- 
periments the learning factors of frequency, 
selective attention, reward, and punishment 
have been investigated in terms of their 
relationships to perceptual responding. The 
ambiguous cues to which the perceptual 
responses were made consisted of parts of 
the stimuli seen in previous learning trials. 

There has been a good deal of interest in 
recent years in finding the relationship be- 
tween certain aspects of past experience and 
various phenomena which are clearly per- 
ceptual or which are closely related to per- 
ception. А large proportion of the work in 
this area has involved studies with primary 
emphasis upon the effects of variations in 
experiential frequency upon perception. In 
some of these studies the past experience 
has meant the individual's personal history 
in his pre-experimental environment, while 
in others past history has implied certain ex- 
perimentally controlled learning trials prior 
to some perceptual test. Thus, Howes and 
Solomon (1951) found that visual recogni- 
tion threshold, as measured tachistoscopi- 
cally in an ascending series, approximated 
a decreasing linear function of the loga- 
rithm of word frequency as tabulated by 
Thorndike and Lorge (1944). The obtained 


1 This research was supported by Research 
Grants M-1259 and M-2170 from the National 
Institute of Mental Health of the National Insti- 
tutes of Health, United States Public Health 


Service. 


correlations between mean duration thresh- 
old and word frequency varied between 
—.68 and —.82, in separate determinations. 
The counts of Thorndike and Lorge were 
assumed to be estimates of the relative fre- 
quencies with which their subjects (Ss) had 
encountered the various words. Attneave 
(1953) postulated that a high degree of 
familiarity with stimulus materials “may 
serve as a substitute for actual stimulation 
in the perceptual process” (p. 81). In his 
experiment Ss were required to judge 
the relative frequency of occurrence, in 
English text, of each letter of the alphabet. 
He found that a power function provided a 
reasonably adequate fit for the points 
plotted on the basis of median judged fre- 
quency and actual frequency of occurrence 
of the letters in a sample of 10,341 letters 
from printed material. The correlation be- 
tween log judged frequency and log actual 
frequency was .88. 

Solomon and Postman (1952) empha- 
sized that an individual S’s relative fre- 
quency of contact with units of English 
language is only crudely estimated by counts 
of printed material. Accordingly, in order 
to have better control of frequency, they 
experimentally manipulated the frequency 
with which 5$ used pronounceable nonsense 
words. Visual recognition threshold for 
words was reported as a negatively acceler- 
ated decay function of frequency of experi- 
mental exposure with zero frequency in- 
cluded in the analysis, and as approximately 
a decreasing linear function of the logarithm 
of frequency when the zero frequency was 
omitted. Arnoult (1956), more recently, 
exposed 10 random nonsense stimuli to Ss 
at varying frequencies and then asked these 
Ss to rate the same stimuli on a familiarity 
scale containing five points. He found rated 


2 


familiarity of nonsense shapes a monotonic, 
negatively accelerated function of the ex- 
perimentally controlled frequency of experi- 
ence. 

Besides the research on the effects of fre- 
quency on perceptual responses, there have 
been studies on the influence of such factors 
as deprivation, reward, punishment, set, 
meaning, avoidance training, and labeling 
upon perception. Thus, Sanford (1936) 

ad Ss give their interpretations of am- 
biguous pictures shortly before and shortly 
after a regular meal, and then counted the 
number of responses involving food and 
eating. He found many more food-related 
responses before as opposed to after the 
meal. Later, Sanford (1937) and Levine, 
Chein, and Murphy (1942) obtained re- 
sults providing further confirmation of San- 
ford’s hypothesis relating food deprivation 
and an increase of Perceptual responses in- 


volving food, Proshansky and Murphy 
(1942) had Ss estimate the lengths of inter- 
mediate lines, the y 


À veights of intermediate 
objects, and the ext 

ment, before and a 
Jects, and moveme: 
Were rewarded w 
Jects, and movem; 


Were punished, 
Consisted, res 


pectively, of giving 15 cents 
to and Temoving 15 cents from the Ss 
hese Ss showed a si i 


Bruner, Miller, and Zimmerman (1955) 
found that errors i 


of recognizing words, 
presented in conjuncti 
noises, decreased а: 
of possible alternati 
was greater for s] 
for longer lists 
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syllables by means of electric shock during 
a training period. Nonshocked syllables 
were also presented during this period. In 
later tachistoscopic trials they found the 
mean GSR {ог wrong responses higher 
when a shock syllable was shown than when 
a nonshock syllable was shown, indicating 
that Ss can make "autonomic discrimina- 
tions" even when they do not give accurate 
verbal reports. Finally, Hake and Eriksen 
(1956) found that the performance of А 
оп a recognition task was affected by a 
number of labels used in naming the T 
patterns used in the task, but not by the 
number of labels used in naming the same 
patterns previously. i 

The experiments to be reported here a 
in the general class of research aimed gs 
determining the relationships between v 
periential and perceptual variables. The 
experiential variables in these experiments 
are frequency, reward, and punishment ; 
While the perceptual variables involve Te- 
Sponses indicating recognition of ambiguous 
stimuli. An exp 


erimental approach to eval- 
uate and control the 


s n 
experience has bee 
used. 
Recognition has been assumed to be 2 
classif 


ying process where cues are used A 
a set of decisions as to the assignmen 
given stimulus object to a partic 
Чазз. Binder (1955) and Bruner (1957) 
have adopted this approach in their — 
retical presentations of the processes of F с^ 
erception, respectively. Я 
point, a naming respon 
icular stimulus object ? 


make 
of a 


Thus, when a person » 
confronted with the task of identifying 
€ its size to rule out od 
asses as “basketball” and hg $ 
DS its shape to rule 
One would predict, or 
analysis, that, when an p 
earned to make discrimina the 
Tesponses indicating recognition On о 
basis of Configurations of cues, he сап of 
identify objects on the basis of subset? 
the configurations, 


ball" and perpa 
“tangerine.” 
basis of this 
vidual has | 


$ 
: " есе 
But, if the complete set of cues is "T 
Sary for unique discrimination, any 1 


s 
lass defined on the basi 


| 


tion in information resulting from the avail- 
ability of only a subset of all cues produces 
ambiguity. If a person catches only an in- 
Stantaneous glimpse of a spherical object 
traveling through the air at high velocity, 
for example, we assume that his previous 
experience with full cue objects enables 
him to use the reduced cues for identifica- 
ton purposes. The cue “spherical,” how- 
ever, has been part of the complete set of 
cues of many objects, including tennis balls, 
golf balls, and oranges. The situation is 
ambiguous because of the limited informa- 
tion in the form of cues. 

Ambiguity may also occur when too 
many cues are present, as, for example, in 
the case of reversible figures where the cues 
defining two or more objects are present in 
the stimulating situation. Thus, considering 
stimulus-object classes as defined on the 
basis of common attributes or cues, a stim- 
ulus situation. may be called ambiguous 
When the entire set of cues available to an 
Observer belongs to the attribute arrays of 
two or more such classes. 

The tachistoscope produces ambiguity by 
presenting а stimulus object so rapidly that 
details are obliterated in the over-all blur 
Seen by an observer. The cues available to 
the observer might have come from any 
number of different objects. And the faster 
the stimulus is presented, the more the blur- 
ring effect and the greater the ambiguity as 
a result of the larger number of stimulus 

objects from which the available cues could 

have соте, 

In the case of the Rorschach and similar 
projective techniques, ambiguous stimuli are 
used to assess various aspects of the life 
histories of individuals. An unstructured 
form, which has shape, color, and/or tex- 
ture-like attributes common to many object 
classes, is presented to Ss under instructions 
to specify what it looks like. Some may 
call a given blur a human, others a bear, and 
perhaps still others a dead eagle. Assuming 
no pathology on the part of the 5, attributes 
of the blur may be characteristic of the 
objects in each of these classes. The attri- 
butes which would lead to clear differentia- 
tion among the classes are not available. 
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But what are the factors which lead to 
one response rather than another equally 
adequate one in an ambiguous situation? 
Which response, for example, would the 
person, who caught the quick glimpse of 
the object in the air, make if asked to iden- 
tify the object. If he were standing on a 
tennis court, he would almost certain. 


t k 1 is ly call 
it a “tennis ball,” unless, perhaps, there 
were an adjoining golf driving range. The 


response would therefore clearly be related 
to the frequency with which the various 
objects were previously experienced in the 
given context. Using the Rorschach and 
similar projective techniques, the clinician 
uses the responses of S to evaluate the lat- 
ter’s personality. The clinician is, explicitly 
or implicitly, assuming that the choice 
among the possible recognition responses 
of an individual to an ambiguous blot con- 
figuration is a function of the individual's 
previous experiences of reward, punish- 
ment, deprivation, frustration, and so forth. 
One person may have been frustrated in his 
experiences with women and give no re- 
sponses dealing with femininity throughout 
his entire Rorschach protocol Another 
may have been deprived sexually and report 
sexual symbols on every blot. 
Much of the research reported above on 
the relationship between experience and 
perception has dealt specifically with the 
factors which influence responses to ambig- 
uous cues (Levin et al., 1942; Proshansky 
& Murphy, 1942; Sanford, 1936, 1937; 
Solomon & Postman, 1952). But there has 
been no systematic attack on the problem, 
and wide gaps remain in our knowledge. 
The purposes of the present experimental 
program were to investigate certain rela- 
tionships not considered in the earlier work 
and to explore further some of the factors 
previously evaluated. Particular emphasis 
was laid upon the analysis of the effects of 
frequency of occurrence upon responses to 
ambiguous cues. 


EXPERIMENT I 


There were two principal goals to this 
experiment. The first was to investigate the 
functional relationship between the fre- 
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quencies with which stimulus figures are ex- 
perimentally experienced and the frequen- 
cies of responses to cues which are parts 
of two or more of these figures. At one 
point in the presentation of his model of 
recognition, Binder (1955) assumed that 
an "individual uses a frequency of designa- 
tion of a particular class name equal to 
Pin where P; is the objectively measured 
probability of occurrence of the given class 
and л is the number of repetitions of the 
set of conditions" (p. 123). If we consider 
the probability of occurrence of an object 
equal to the relative frequency of its previ- 
ous occurrence in the given context, this 
leads to the prediction that the relative fre- 
quency of a particular response to ambig- 
uous cues is equal to the relative frequency 
of occurrence of that response and its asso- 
Ciated stimulus. The second goal was to 


test the hypothesis that 55 who learn to 


discriminate among objects on the basis of 
configurations of 


cues learn to make accu- 
rate classifying геѕропѕеѕ on the basis of 
parts of these configurations. As stated 
above, this is an implicit assumption in the 


models of Binder (1955 
(1957). p ) and of Bruner 


Method 


Ss from their normal viewing position. Panel, 
table, and box were all! painted a dull black. 

'The apparatus was situated in a semidarkened 
room, and Ss could see the stimulus and response 
cards only when the bulbs behind the mirrors were 
on. At other times, of course, they could only Bun 
the reflecting surfaces of the mirrors. The mirror: 
were designed to provide 33% transmission ee 
light, 3396 reflection, and absorption of the remain 
ing light. 

Presentation materials. Eight nonsense syllables 
of 3396 association value were used as mens 
during the learning trials of the experiment (бї 
1928). The names are as follows: TUZ, RI "i 
WOY, JOM, KAL, NAF, VED, and GEP. е. 
stimulus figures were also used and may be Se. 
in Fig. l. The squares within which the differen 
ating cues are placed are 4 in. in length on ea! 
side. These stimulus figures were designed in Ren 
а way that every cue within the common squa! 
was necessary for some discrimination. 


LEARNING 
FIGURES 


TEST 
FIGURES 


TEST 
FIGURES 


LEARNING 
FIGURES: 


Fic. 1. Lea 
Exps. I, П, and III. 


| ыу. ЖШ 
Thing figures and test stimuli f° 


The three test Stimuli ma: n in Fig. 1. 
Test Stimulus A contains D wp common 
to Figures А, and A:, Stimulus B the set of od 
common to B, and В, and Stimulus C the cue СО 
a, and C. 


pe Саге was taken during |. 


е cues appearing o gn" 
as to provide for A on 
ace of common cue 


d 
of pai MI Ss were trained by the m лей} 
Paired associates (with correction) during i 
earning phase, with stimulus figures appearing 
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the left mirror and nonsense syllables in the right 
mirror. The following intervals were used: stim- 
ulus exposure 51⁄4 secs., response exposure 2 secs., 
intertrial interval 734 secs., and delay after warn- 
ing buzzer 11 secs. The delay between blocks 
averaged about 30 secs., but was never less than 22 
secs, 

The procedures used for the Ss in Group I-E 
and those in Group I-U differed only in the com- 
position of each learning trial block. For the Ss 
in Group I-E, a trial block consisted of a single 
presentation of each of the eight stimulus figures 
together with its associated nonsense name. The 
trial blocks for the Ss in Group I-U contained 
figure-name pairs presented varying numbers of 
times, In the trial blocks of Group I-U, either 
А, ог As occurred twice and the other once, В, ог 
В: occurred four times and the other once, and one 
of C, C, Ca or Си occurred three times and the 
other three once each, While A, and Аз always 
appeared in the ratio of two to one per trial block, 
the particular one which occurred twice and the 
one which occurred once were randomly deter- 
mined for different Ss, In a similar fashion, 
whether Bi ог B: appeared four times per block 
and whether Ci, Cs, Co, or С: appeared three times 
Der block were randomly determined. _ Table 1 
Shows a summary comparison of the trial blocks 
for the two groups. This table can be considered 
only representative, of course, since the particular 
one of a subset (like A: or Аг) which occurred 
more frequently varied randomly over Ss. The 
Group I-U trial blocks in all cases consisted of 14 
pairings. 

The nonsense syllables paired with. particular 
figures were randomly determined for different Y 
The Ss were instructed that the associated non- 
sense syllables were to be considered the names 
of the figures and that they were to anticipate 
each figure’s name after the figure was exposed 
but before its nonsense name appeared. To make 
this more like a recognition situation, the Ss pro- 


TABLE 1 


NUMBER ог OCCURRENCES OF LEARNING FIGURES 
PER TRIAL BLOCK FOR ONE POSSIBLE ASSIGNMENT 
IN Exp. I 


Learning Figure Group 1-Е Group 1-0 


А; 1 2 
Аз 1 1 
Bi 1 4 
Be 1 1 
С 1 3 
C. 1 1 
Cs 1 1 
€, 1 1 


Total Presentations 
per Block 


со 
m 
> 


nounced rather than spelled ош the nonsense sylla- 
bles. During the first trial block, E pronounced 
each syllable as it appeared, in order to minimize 
any reluctance on the part of Ss to pronounce any 
syllable. 

The ordering for the presentation of stimuli for 
each odd-numbered trial block was determined by 
a randomization schedule which E had prepared 
in advance. A restriction imposed on the random- 
ized order was that a single stimulus could not ap- 
pear more than twice in succession. The reverse 
order of presentation to that of the immediately 
preceding odd-numbered block was used for each 
even-numbered block. The sequence of randomized 
order as read from the schedule, reversal of that 
order, new randomized order, etc. was repeated 
with only one break until a criterion of two con- 
secutive blocks of perfect anticipations was 
reached. This break consisted of a 5-min. rest 
period at the end of 40 min. (if the criterion had 
not been reached by this time). At the criterion 
point of two perfect blocks, the Ss were given 
another 5-min, rest period. After this rest period, 
the presentation of learning blocks of paired asso- 
ciates was repeated until the Ss achieved one more 
block of perfect anticipations. They were then 
given their first test trial, and this was followed by 
the learning trials necessary for another block of 
perfect anticipations. Then came the second test 
trial, learning trials to one further block of perfect 
anticipations, and finally the last test trial. The 
purpose of the learning blocks intervening between 
all test trials was to assure as far as possible that 
the Ss maintained the learning level achieved previ- 
ously. 

The three test trials which all Ss were given 
consisted of single presentations of each of the 
Stimuli A, B, and C (see Fig. 1), The order of 
presentation of these three stimuli was randomized 
with the one restriction that Stimulus C was not 
to appear on the first trial. The test stimuli were 
shown for a duration of 124 secs, beginning 1% 
secs. after the sounding of the warning buzzer. 
The intertrial interval was 9 secs. For the test 
phase of the experiment only the left mirror of 
the exposure apparatus was used. 

Immediately following the first criterion of two 
consecutive perfectly anticipated trial blocks, the 
Ss were told that they were ready for the second 
phase of the experiment in which a new figure 
would be presented after each further block of all 
correct responses. They were told that the new 
figure would not be exactly like any of those shown 
before and would be exposed for a shorter period 
of time, but that they were to respond with the 
name of the most closely related figure. The addi- 
tional instructions during the actual test phase, 
after the 5-min. break, were as follows: 

In order to make sure that you still remember 
the characteristics together with the correct 
name of each of the figures presented during the 
first part of the experiment, I will now present 
them in the same way once again. As before, 
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I would like you to name the figure shown in the 
left window just as soon as you can aíter it is 
exposed, before the name is shown a few sec- 
onds later in the right window. We will not go 
on to the new figure, which I told you about 
before, until you complete a run giving the cor- 
rect name of all of the figures before their 
names are shown in the right window. Correct 
any misses or mistakes as you did previously. 

[After criterion of one perfect block]: 

Now here is a new figure. Give the one name 
out of the set you have learned which seems 
most appropriate, even though this figure is not 
exactly like any of those just shown. No name 
wil appear in the right window. (The figure 
became visible to the Ss about 9 secs. after the 
completion of these instructions.) 

[After exposure of first test stimulus] : 

Once again we will rerun the figures and 
names as in the first part of the experiment to 
make sure you remember them. Name the figure 
before its name appears in the right window 
and correct misses and errors. 

After the criterion of one further errorless block 
was achieved, the same instructions were read 
as after the perfect block above ("Now here is a 
new figure," etc.). These were repeated once again 
before the final test trial. The instructions which 
followed the first test stimulus (“Once again we 


will rerun," etc.) were reread prior to the final set 
of trials of paired associates. 


Results and Discussion 


Let us consider an example in terms of 
the Tesponses to Test Stimulus A (see Fig. 
1) in order to illustrate how the results 
may be analyzed. If an S’s response to A 
is the name of either Figure A, or Figure 


2. 


2, it is called an appropriate response. If 
he responds with one of the names of any 
of the other six figures, it is called an inap- 
propriate response. A third category of re- 
sponse could occur if an S did not follow 
instructions and responded to a test stim- 
ulus with a name which had not occurred 
during learning. This happened in very few 
cases, and these responses were placed in 
the inappropriate category. Thus, we can 
consider the results in terms of the number 
of appropriate responses to A as opposed 
to the number of inappropriate responses 
and in terms of the relative number of A, 
and A, naming responses to А. Similar 
analyses may be made of the responses to 
Test Stimuli В and С. 


The numbers of appropriate and inap- 
propriate responses as well as the numbers 
of responses of differing learning frequency 
within the appropriate category may be seen 
in Tables 2 and 3. The frequencies expected 
from the hypothesis that relative frequency 
of response equals relative frequency ot 
learning occurrence are shown parentheti- 
cally in the columns designated "Approprr 
ate Responses.” The correspondence be- 
tween the values expected on the basis of 
this hypothesis and those actually obtaine 
is obviously close. However, it will be most 
convenient to delay a fuller discussion O 
these particular results until the data are 
analyzed in terms of the appropriate-inap- 
propriate dichotomy. 

According to the second hypothesis stated 
in the introduction to Exp. I, Ss who ha 
learned the figure-nonsense name assocla~ 
tions perfectly might have been expected 
to have made no inappropriate test T€ 
sponses. But, despite the fact that a str” 


TABLE 2 


OBTAINED AND EXPECTED (IN PARENTHESES) 
RESPONSE FREQUENCIES TO Test STIMULI 
For Ss iN Group I-E 


== 


Test Stimulus A 
= Inappropriate 


Appropriate Responses Responses 


A А, 5 
15 15 
(15.0) (15.0) 


Test Stimulus В 


А. 


Inapproptiat® 


Appropriate Responses Response? 
اض‎ à Mg l.S 
Bi Ba 7 
15 13 
(14.0 14. 
) (40) P 


Test Stimulus C 


aË 
Тпарргор@ 


Appropriate Responses Response? 
Cı С, Ci С, ? 
8 а 4 1 11 
(8.25) (8.25) (825) (825) 


| 
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TABLE 3 


OBTAINED AND EXPECTED (IN PARENTHESES) 
RESPONSE FREQUENCIES TO TEST STIMULI 
кок Ss IN Group 1-0 


Test Stimulus A 


Appropriate Responses Inappropriate Responses 


AQ) A(1) 8 
20 3 
(15.3) (7.7) 


Test Stimulus B 

АЕ Е неа 
Appropriate Responses Inappropriate Responses 
B(4) 
20 
(20.8) (5.2) 


B(1) 5 


Test Stimulus C 


vu ete 
Appropriate Responses Inappropriate Responses 


C(3) All C(1) Pooled 3 
13 15 
(14.0) (14.0) 


gent criterion was used, Ss made errors 1n 
terms of misclassification up to the final one 
of the required five perfect blocks. А factor 
based on incorrect anticipations during the 
learning trials following the achievement of 
the first three required perfect trial blocks 
was accordingly used to estimate the imper- 
fection in learning at criterion. 

In order to make a correct anticipation 
during learning, a given 5 must use all the 
information in the form of cues: circle ог 
triangle, wavy or straight line at the top, 
and line at right or left corner (see Fig. 1). 
He must, in other words, classify correctly 
according to three dichotomies. An incor- 
rect response may result from an error in 
one, two, or three of those dichotomous 
categorizations. If the figure exposed hap- 
pens to be В», for example, a response cor" 
sisting of the name of В, or А» would re- 
sult from one misclassification while the 
fame of C, to Bs would represent three 
misclassifications. - 

The computations used in arriving at the 
expected number of appropriate and inap- 
Dropriate test responses on the basis of the 


errors in anticipation made by Ss at cri- 
terion are outlined in Table 4. For all Ss, 
total number of misclassifications on all 
trials, following the first three required 
perfect trial blocks, were counted. Also 
counted were the total number of blocks, 
over all Ss in the respective groups, after 
the third perfect one and up to and includ- 
ing the fifth perfect one. Since each block 
for Group I-E contained eight trial pairings 
and three classifications were necessary per 
trial (one for each cue), the total number 
of necessary categorizations for the Ss in 
Group I-E after the third perfect block is 
the total number of trial blocks multiplied 
by 24 (8 X 3). The multiplier in the case 
of the Ss in Group I-U is 42 (14 X 3) since 
there were 14 trials per block for Group 
LU. The number of misclassifications per 
attempted categorization at criterion for 
each group was obtained by dividing total 
misclassifications by the number of neces- 
sary categorizations. 

The number of inappropriate responses 
expected purely on the basis that the Ss had 
not learned to classify without error using 
each of the three cues could now be com- 
puted. When presented with Test Stimuli 
A or B (see Fig. 1) each S had to cate- 
gorize on the basis of two cues, while when 
presented with C he had to make only one 
dichotomous categorization (one of the C; 
fi = 1, 2, 3, 4] stimulus figures or not a 
C; stimulus figure). The total number of 
categorizations necessary Бу the Ss in Group 
LE to Stimulus A was 70 (35 X 2); to B, 
70; and to C, 35. The comparable numbers 
for the Ss in Group I-U were 62, 62, and 
31. The total number of necessary test 


TABLE 4 


PERFORMANCE IN LEARNING TRIALS FOLLOWING 
First THREE REQUIRED PERFECT BLOCKS IN Exp. I 


(Summed Over All Ss) 


Group I-E Group I-U 


Total Misclassifications 91 83 
Trial Blocks 128 113 
Necessary Categorizations 3072 4746 
Misclassifications рег At- 

tempted Categorization — .0296 .0175 
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TABLE 5 


FREQUENCIES OF APPROPRIATE AND INAPPROPRIATE 
RESPONSES TO TEST STIMULI FoR Вотн GROUPS 
or Ss IN Exe. I 


Obtained Expected 
Responses Responses 
Test 
Ѕіти- | Appro- Inappro- | Appro- Inappro- 
Group lus | priate priate | priate priate 
I-E A 30 5 93 2 
I-E B 28 7 33 2 
1-Е € 33 2 34 1 
1—0 А 23 8 30 1 
1-0 В 26 5 30 1 
1-0 € 28 3 30 1 


categorizations multiplied by the number 
of misclassifications per categorization at 
criterion gives the total expected inappro- 
priate responses to the test stimuli. 

The expected frequencies of appropriate 
and inappropriate responses and also the 
obtained frequencies may be seen in Table 
5. Fisher's exact test was applied to all 
differences between expected (on the basis 
of actual learning responses) and obtained 
frequencies, and only the difference due to 
the responses of the Ss in Group I-U to 
Test Stimulus A is significant. The sig- 
nificance level is Р = 05. It would seem 
reasonable to conclude that these results 
Support the hypothesis that Ss can identify 
components of the stimuli to which they 
previously learned names, 

Perhaps a final remark should be made 
on the justification for choosing the learn- 
ing trial blocks following the third required 
perfect block for predicting frequencies of 
inappropriate test responses. Each 5 had to 
achieve five blocks with perfect anticipa- 


* This requires the assumption that each $ who 
gives an inappropriate response makes only one 
misclassification in doing so. But this would seem 
entirely reasonable since the average number of 
misclassifications for each erroneous anticipation 
during learning was 1.00 for the Ss in Group I 
and 1.02 for those in Group П. And to make the 
argument even stronger, the maximum number of 
misclassifications possible was three during the 


learning trials and only two during the testing 
trials, у 


tions : two consecutive ones during the learn- 
ing phase, one more after a 5-min. rest pe 
riod, and one additional perfect block after 
each of the succeeding two test trials. It 
was necessary to be sure that Ss were at 
criterion level of performance for these 
computations of misclassifications and the 
trial blocks prior to the first two perfect 
ones could obviously not be used. Nor 
could the trials between the second am 
third perfect blocks be used since а rest 
period, and presumably some forgetting, 
had intervened between these blocks. After 
the third perfect block, however, only ; 
114-sec. test trial and a very brief set © 
instructions came between successive learn- 
ing trial blocks. This made it reasonable 
to assume that the Ss were not below their 
first criterion level during this period. | 
For the sake of facilitating. future dis- 
cussion (with specific reference to the SS 
in Group I-U) it is convenient to introduce 
notation to allow us to refer to the stimulus 
figure occurring at a given frequency p 
block, regardless of the specific figure 35 
may be. Accordingly, let А (1) be the Ai 
(i = 1 or 2) figure which occurs once Ре" 
block for a certain $ (as randomly derer” 
mined) and A(2) the A; figure occurring 
twice per block for the S. Thus if ^" 
occurs twice per block and А, once 45 a 
result of the random determination, AC() 
refers to Аг and A(2) to A, in this pat 
ticular case. In a similar fashion, let BC ) 
be the B; (i = 1 or 2) occurring once Ре! 
block, B(4) the В; occurring four time 
per block, C(1) any of the three C; (7 7. 
1 or 2 or 3 or 4) figures occurring once P^ 
block, and C(3) the C; figure occurring 
three times per block for a given S. " 


Let us return to the comparative e 
quencies which were obtained within 02 
category of appropriate response. Tab P d 
contains the frequencies of the respon? ү 
of the Ss in Group I-E to the three e 
stimuli. The numbers in parentheses Ж he 
obtained for a given test stimulus bY 
formula 


Oi 
o, (№) 


where О; represents the number of Û 


mo 


ue m 
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the i? figure occurred on each learning trial 
block, О, represents the number of times 
the learning figures containing all cues of 
the given test stimulus occurred per trial 
block, and N represents the total number of 
appropriate responses given to the test stim- 
ulus by all Ss. As an example of the use 
of this formula, consider the responses 10 
Test Stimulus А (Table 2). In this case 
the total number of appropriate responses, 
or N, is 30. If the learning figure of inter- 
est (that is, the i^ figure) were As Оз = 1. 
Since there are two learning figures con- 
taining all cues of Test Stimulus A. (that is, 
A, and А.) and they occurred a total of 
twice per trial block, Os — 2. Therefore, 
we have 

Оз (у) = (30) = 15 
О, 
We thus predict а total of 15 responses to 
Test Stimulus A consisting of the name 
Previously associated with As. The paren- 
thetical numbers in Table 2 are in all cases, 
then, the frequencies of various appropri- 
àte responses which would be expected if 
the Ss were giving particular appropriate 
responses at the same relative frequencies 
às the occurrence of their associated stimuli 
per learning trial block. 

Table 3 contains the comparable obtained 
and expected frequencies for Ss in Group 
I-U. The numbers in parentheses for a 
given test stimulus are not equal (as in 
Table 2) since the learning stimuli were 
presented at differing frequencies per trial 
block. 

The relative frequencies expected on the 
basis of learning occurrences were consid- 
ered theoretical (population) values in test- 
ing for the significance of the difference 
between obtained and expected relative fre- 
quencies. The binomial test was used for 
the five cases in which the appropriate re- 
sponses fell in two classes, and the x? test 
for the one case involving four appropriate 
classes of response (that is, the responses to 
Stimulus С for the Ss in Group I-E). Only 
one of these six comparisons showed a sig- 
nificant difference (Р =.05). This ават 


occurred in the responses to Stimulus A for 
the Ss in Group I-U. 


Since much of the above is consistent 
with recency, it was desirable to check upon 
whether or not the particular appropriate 
responses given tended to be the last en- 
countered in learning. Accordingly, the 
learning block preceding each test trial for 
the Ss in Group I-E was examined to see 
if the test response was or was not the most 
recent of the possible appropriate responses. 
It should be emphasized that, for this pur- 
pose, only appropriate test responses to a 
given test stimulus were considered and 
the sequence of figure-name pairs during 
the immediately preceding learning block 
was evaluated only from the standpoint of 
the ordering of appropriate responses. 
Thus the two appropriate responses to A 
are the names of A, and A, and, when 
either of these names was given, the prior 
learning block was examined to see if the 
stimulus-response pair corresponding to 
that name did or did not occur after the 
other possible pair which could have been 
named. The test responses to B and C were 
similarly analyzed. Since there are only 
two appropriate responses to Stimuli A and 
B, chance occurrence with no recency effect 
would be indicated by an equal division of 
the appropriate responses into "last" and 
“not last" categories. In the case of Stim- 
ulus C there are four appropriate responses, 
so one in four can be expected to be a last 
response by chance. Table 6 contains the 
number of appropriate responses to each of 
the three test stimuli which were and which 
were not the names of learning figures oc- 


TABLE 6 


NUMBER OF APPROPRIATE RESPONSES FOR 
Group I-E Watch WERE AND WERE Nor Last 
LEARNING RESPONSE (AS COMPARED WITH OTHER 
POSSIBLE APPROPRIATE RESPONSES) 


Last Not Last 


Expected Expected 
Test by by 
Stimulus | Obtained Chance |Obtained Chance 


A 12 15 18 15 
B 14 14 14 14 
c 10 8.25 23 24.15 
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curring last (ignoring inappropriate stim- 
ulus names) in the immediately preceding 
learning block. Table 6 also gives chance 
expectations for “last” and “not last" re- 
sponses. Adding up all "last" and "not 
last" responses gives sums of 36 and 55, 
respectively. When опе considers that 
purely random responding with regard to 
“lastness” gives expectations of 37.25 “last” 
occurrences and 53.75 “not last” occur- 
rences, it is clear that the data present no 
evidence for a recency effect large enough 
to effect the above interpretations. How- 
ever, it should be pointed out that this test 
is surely not sensitive enough to detect the 
small sequence effects predicted and found 
by such investigators as Estes and Straug- 
han (1954). 

As a final recency comparison, it was 
found that 16 of the total of 105 responses 
(appropriate and inappropriate) given by 
all Ss to all test stimuli were the names of 
the very last stimulus (considering all eight 
learning figures in this case) to occur in 
the previous learning blocks. This compares 
with an expectation of 13.1 if the Ss were 
giving the most recent of all learning re- 
sponses purely on a chance basis. 


Thus, a very striking relationship be- 
tween the perceptual variable, frequency of 
response to an ambiguous testing stimulus, 
and the learning variable, frequency of oc- 
currence of figures in blocks of paired 
associates, was found. The evidence sup- 
Ports the preliminary prediction that the 
frequencies of responses consisting of the 
various possible appropriate figure names 
parallel the frequencies with which the Ss 
had previously responded to the figures. 

In only one of the comparisons did this 
relationship fail to hold: that was in the 
case of the responses of the Ss in Group 
LU to Test Stimulus A. There were other 
idiosyncrasies in this particular case. First 
as pointed out above, this was the only 
group that gave significantly more inap- 
propriate responses than expected on the 
basis of misclassifications during learning 
trials at criterion. And second, if one looks 
at the particular kinds of inappropriate re- 
sponses made, one finds that six of the eight 


errors consisted of the name of Figure C, 


to Test Stimulus A (see Fig. 1). Notice 
that C, differs from the appropriate re- 
sponse to A, Ау, only in that the former 
has a wavy line and the latter a straight line 
at the top. In all other test trials the inap- 
propriate responses were much more evenly 
scattered over the possible choices. The five 
inappropriate responses of the Ss in Group 
I-E to Test Stimulus A, for example, con- 
sisted of one C, name, one B, name, one 
В. name, and two С. names. 

While it is hardly startling, support was 
also found for the hypothesis that Ss who 
have learned differentiating responses to 
configurations of cues are able to make 
accurate discriminatory classifications on the 
basis of cue components. As mentioned 
above, the only discrepancy in six compari- 
sons was the generally pathological case 
of the responses to Stimulus A by the Ss 
in Group I-U. 


EXPERIMENT II 


The primary purpose of this experiment 
was to test further the hypothesis that the 
relative frequency of an appropriate rec- 
ognition response to ambiguous cues is equa 
10 the relative frequency of that response 
and its associated stimulus during prev!” 
ously reinforced learning trials. A second- 
ary purpose was to explore the generality 
of this relationship by altering the condi- 
tions in the one previous experimental sitt- 
ation in which it did not hold. This altera 
tion was accomplished by dividing the eig” 
stimulus figures of Fig. 1 into two sets ай 
then administering them to separate group? 
of Ss. The division of figures was such that 
the appropriate response figures for the 16 
stimulus giving the discrepancy in the prev 
ous experiment (that is, A) were in one 
part and the figure associated with the 2 
appropriate response which seemed to Бе E. 
source of difficulty (that is, C4) was iP } 
other. This division had the further V° 


€ 
marked advantage of reducing the averae n 


time necessary to complete the experim* 
from the approximately 85 min. require 
of the 55 in Exp. I to about 25 min. Thi 
facilitated the accumulation of much 12759 
sample sizes, which seemed very desira 


ES E E 
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in attempting to verify the relationships 
found previously. 


Method 


Subjects. The Ss were volunteers from the 
introductory psychology course at Indiana Univer- 
sity. All Ss were run individually but were con- 
sidered to be in one of two groups in accord with 
the learning procedure used. Group П-АВ con- 
tained 55 Ss; and Group II-C, 54 Ss. In contrast 
to the losses of Exp. I, very few Ss were lost 
because of administrative difficulties in this experi- 
ment, 

Apparatus and materials. The same apparatus, 
stimulus materials, and response words as in 
Exp. I were used. 

Procedure, All Ss were trained by the method 
Of paired associates in the manner described for 
Exp. I, with only those changes necessitated by 
the different stimulus groupings. The stimuli used 
for the Ss in Group П-АВ were As, As Bı and 
В, during the learning trials and A and B during 
the test trials. The name associated with a given 
figure was randomly determined for each S from 
among the eight possibilities. 

Each trial block for Ss in Group II-AB was 
eight stimulus-response presentations in length; 
two of the four figures occurred more than once 
Der block. The specific figures replicated varied 
randomly over Ss. For some Ss, Figure A; and 
Its associated nonsense syllable occurred twice per 
block and Figure A: once, while for others A, 
Occurred once and А: twice. Similarly, Figure В: 
Occurred four times per block and B: once for 
Some Ss, while B; occurred once and B: four times 
for others. Once chosen for a given S, the as- 
signed frequencies for the figures were maintained 
through all of his learning blocks. The following 
is an example of stimulus composition and order- 
ing for a Group П-АВ block: В, As, Bs, Bs, Аз, 
В, А», В. 

The stimuli used for the Ss in Group II-C were 
С, C; Cs, and С, during the learning trials and С 
during the test trial. Again four of the syllables 
were randomly matched with the learning figures 
for different Ss. The learning trial blocks for the 
Ss in Group II-C were only six figure-name pair- 
ings in length. One among Cı, Cs, Cs, and С, was 
exposed three times per trial block and the other 
three once each, but the more frequently occurring 
one was chosen randomly for different Ss. Once 
chosen, this remained the figure which occurred 
three times through all the trial blocks for an S. 
^. possible figure composition and ordering for a 
block of Group С is Cs, Cj, Cs, Ca, Co, Cs. 

Prior to the learning trials, the Ss in both groups 
Were told that the nonsense syllables were to be 
Considered the names of the figures and that their 
task in subsequent trials was to anticipate this 
name after the figure was exposed but before the 
name appeared, The order of presentation within 

lock during learning was random on odd-num- 


bered trial blocks (with the restriction of no more 
than two consecutive appearances) and the reverse 
of the previous odd-numbered block on even blocks. 
Learning proceeded to a criterion of two consecu- 
tive trial blocks with perfect anticipation. After 
reaching this criterion, all Ss in both groups were 
given a 5-min. rest period, at the end of which the 
paired-associates blocks were repeated until they 
achieved one further block of perfect anticipations, 

'The Group II-AB Ss were then given their first 
test trial, and this was followed by a further set of 
learning blocks to a criterion of one more perfect 
block. Then came the final test trial. During the 
two test trials of the Ss in Group II-AB, Test 
Stimulus A was presented once and Test Stimulus 
B once in random order over Ss (see Fig. tx. 


Following the rest period and succeeding trials 
to one perfect block, the Ss in Group C were given 
a single test trial consisting of Test Stimulus С. 
This terminated the experiment for them. 


Prior to each test trial all Ss were told that a 
new figure, somewhat different from all those used 
during learning, would be presented and that they 
were to respond with the one previously learned 
name which seemed most appropriate. They were 
informed of the criterion requirements in terms 
of necessary perfect trial blocks preceding all 
learning stages. 


Results and Discussion 


Table 7 contains the number of Ss in 
Group II-AB who gave appropriate and 
inappropriate responses to Test Stimuli A 
and B. It also contains the number of S's 
who gave the various appropriate responses 
to Stimulus C in Group II-C. To illustrate 
the table’s meaning, when presented with 
Test Stimulus A, 36 of the 55 Ss in Group 
П-АВ gave an appropriate response con- 
sisting of the name of the learning figure 
which occurred twice as frequently as the 
other in their learning blocks, that is, A(2); 
16 Ss made A(1) and three Ss made in- 
appropriate responses (B’s or C's). The 
numbers in parentheses in Table 7 are the 
response frequencies we would expect if 
the Ss responded with appropriate responses 
at the same relative frequencies as their 
occurrence in the learning trial blocks ; they 
were obtained as described in Exp. I. 

The very few inappropriate responses 
obtained makes it reasonable to account for 
the phenomenon of inappropriate respond- 
ing solely on the basis of imperfect learn- 


ing of name pairs. This confirms the previ- 
ous finding. 
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TABLE 7 


OBTAINED AND EXPECTED (IN PARENTHESES) 
RESPONSE FREQUENCIES TO TEST STIMULI 
FOR ALL Ss IN Exe. П 


Test Stimulus А (Group II-AB 5s) 


Appropriate Responses Inappropriate Responses 


А(2) А(1) В, or В» 
36 16 3 
(34.7) (17.3) 


Test Stimulus B (Group II-AB Ss) 


Appropriate Responses Inappropriate Responses 


B(4) B(1) А, or А» 
42 11 2 
(42.4) (10.6) 


Test Stimulus C (Group II-C Ss) 
C(3) 

37 
(27.0) 


All C(1) Pooled 
17 
(27.0) 


The fit between obtained and expected 
frequencies for the responses of the Ss in 
Group II-AB to Test Stimulus A, on the 
one hand, and Stimulus B, on the other, 
is remarkably close. Statistically, the small 
differences between obtained and expected 
values are very far indeed from approach- 
ing significance. These data surely lend 
support to the hypothesis that the relative 
frequencies of various recognition responses 
to ambiguous cues equal the relative fre- 
quencies with which these responses oc- 
curred previously in association with their 
stimuli. 

However, using a x? test, the difference 
between the obtained and expected responses 
to Test Stimulus C by the Ss in Group 
II-C is significant at the .05 level. In look- 
ing at these data in Table 7, it should be 
remembered that the expectation of equal 
frequencies in the two appropriate response 
categories stems from the fact that, although 
one learning figure among the C; occurred 
three times as frequently per learning block 
as each of the other three—that is, C(3)— 

all responses consisting of the names of the 
three less frequent learning figures—all 


C(1)—have been pooled. In effect this 
means that one expects as many responses 
consisting of the more frequent name as 
all three less frequent names combined. | | 
These results are of particular signifi 
cance when compared with the findings A 
Exp. I. In Exp. II, the Ss in Group П-А | 
who were tested on A after learning trials 
involving the same ratio of presentation o " 
A, and A, as the 5s in Group I-U Ыр 
2 to 1), never saw the C; learning joe 
Consequently, the source of difficulty зе 
volving a large number of inapproP The 
C, responses to A was eliminated. de 
present data, therefore, support the hypo ie 
esis that the large number of inapprop' 
responses (of a particular kind) contribui A | 
toward producing the single significat. 1 
crepancy between relative frequencies u 
ing learning and relative frequencies among 
appropriate test responses in Exp. I. LC 
The present data from Ss in Group Irv 
are at variance with the previous finding =. 
no significant difference between expect“ 
and observed frequencies for the same test 
stimulus and the same learning present? | 
tion frequencies. In fact, the obtained == 
expected values for Group I-U are e 
identical. How can we account for pu 
marked difference in results? Assum 
that the difference was not due simply k 
chance sampling fluctuations, we must pe^ 
at the essential difference between the = 
experiments and consider its possible effec 
The Ss of Group I-U were trained On * Ё 
eight learning figures and tested on all thre 
test stimuli, while the Ss in group ^. g 
were trained on only the four C; learn 
figures and tested on С. The entire set 8 
earning and test stimuli of the prese, 
experiment was only part of a larger 
in the earlier experiment. In other ways 
relative frequency of presentation ames 
learning figures, learning criterion regt es 
ments prior to testing, nonsense sylla S 
potentially paired with learning БЕН 
over-all method of learning and test Pcia 
entation, and instructions—the two eX? 
ments were identical. 3.38 
In the case of the Ss in Group II-AB fy, 
well as in the case of the Ss in GrouP | шщ 
the trial blocks for all Ss contained at © 
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two stimuli which appeared more than once. 
This was not the case with the Ss in Group 
II-C who had only one stimulus presented 
more than once per block. When in repeated 
blocks of four different stimuli, one stim- 
ulus appears three times and the others 
appear only once each in successive blocks 
of six trials, it seems reasonable to assume 
that the more frequent stimulus becomes 
relatively conspicuous. Thus, 49% of the 
Ss made no errors in identifying the C(3) 
stimulus throughout their learning trial 
blocks (excluding the first block, of course, 
during which E read aloud the names). 
Moreover, 59% made no errors in identi- 
fying C(3) during either the second, third, 
or fourth blocks; and 6696 made no errors 
in identifying it during the second block 
alone. Fig. 2 contains separate curves of 
total errors per attempted anticipation to 
the C(3) figure, on the one hand, and to all 
three C(1) figures, on the other. Errors 
and attempted anticipations were summed 
over all 5$ for each block. Zero errors were 
assumed for Ss who had reached the cri- 
terion of two perfect blocks prior to a given 
block. It is apparent that most Ss learned 
the name of C(3) within the first two 
blocks and spent the remainder of their 
learning time in discriminating among the 
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Fic. 2. Total errors per attempted anticipation 
Over learning blocks for Ss in Group П-С. 


This would imply that for many Ss there 
was more immediacy and fewer cue-pro- 
ducing responses involved in the association 
between the C(3) stimulus and its naming 
response, at least at certain stages of learn- 
ing. Let us use an example to illustrate this 
argument. Suppose the paired associates for 
a given 5 in Group II-C are as follows: 
С, МОУ, C,—RUX, C;—KAL, and C,— 
GEP (see Fig. 1). Suppose, further, that 
С. occurs three times per block for this 5, 
and C,, С», and C, once each. Now this S 
quickly learns to respond “KAL” when C; 
is exposed. If this association is not im- 
mediate and such intermediate implicit re- 
sponses as, "triangle, wiggly line, line to 
left" occur, they quickly drop out with the 
overlearning concomitant with the frequent 
repetition of only one stimulus. The essen- 
tial body of learning consists of establishing 
discriminations among Cı, Cs, Ca. But this 
may require a good deal of emphasis and 
drilling on cue-producing responses. Thus, 
the learning may involve drilling on se- 
quences like: С, exposed—"circle in middle 
and line in left corner" —"WOY"; or C, 
exposed— "triangle in center and bottom 
line to right" —"GEP." If this process of 
discriminating among the C(1) with the 
help of implicit responses proceeded through 
the learning trials for these 55, the differ- 
ence in immediacy between responses to 
these stimuli and the response of “KAL” to 
С. would, up to a point, become more and 
more pronounced. When the Test Stimulus 
C is presented after that course of learning, 
a certain S might respond in some such 
fashion as: "No circle in the middle and 
no line in left corner, it is not a WOY ; no 
line in right corner, it is not a RUX ; no 
line in right corner and no triangle in cen- 
ter, not a СЕР.” The elements to which the 
cue-producing responses leading to the re- 
sponses of “WOY,” “RUX,” and “СЕР” 
are, in short, not present. The S then gives 
the response “KAL,” the name of the stim- 
ulus which appeared three times per block. 

With enough Ss of Group II-C behaving 
in accord with the above model, the dis- 
proportionately large number of C(3) 
names becomes understandable. Unfortu- 
nately there was no way of evaluating the 
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reasonableness of these formulations with 
the data thus far available and so new data 
had to be collected with proper experi- 
mental variation. 


EXPERIMENT III 


The purpose of this experiment was to 
test the hypothesis that the discrepancy be- 
tween obtained and expected response fre- 
quencies for the 5$ in Group II-C resulted 
from the unusual distinctiveness of the par- 
ticular figure occurring three times as often 
as the others per learning block. According 
to the analysis, this distinctiveness resulted 
in the 55 learning the more frequent figure- 
name association comparatively directly and 
the less frequent figure-name associations 
with the aid of intervening implicit re- 
sponses. And this in turn, presumably, led 
to a larger than expected frequency of the 
associated name of the C(3) figure on tests 
with Stimulus C. 

The method used in this experiment in- 
volved a replication of the conditions used 
for the Ss of Group II-C with one change 
introduced to reduce the distinctiveness of 
the figure which occurred three times per 
block. This diminution in distinctiveness 
Was accomplished by increasing the fre- 
Чиепсу per block of two of the other three 
figures, 


Method 


Subjects. Sixty-five Ss were used in this experi- 
ment. As in the previous experiments they were 
run individually and all were volunteers from 
introductory psychology sections. 

Procedure, The apparatus and stimulus and 

response materials were the same as those de- 
scribed previously. Except for the Specific compo- 
sition of each learning trial block, the procedure 
used for the Ss in Exp, III was identical to that 
used for the Ss of Group П-С. Whereas each 
block for the 5$ of Group II-C consisted of one 
C; presented three times and the other three once 
each, the blocks in the present experiment consisted 
of one C, presented three times, another presented 
once, and the remaining two figures presented twice 
each. The learning blocks for the Ss of Exp. III 
were, consequently, eight pairings in length. The 
particular figures among С, C, С, C, presented 
once, twice, and three times per Моск were chosen 
randomly for a given S, but once selected. the par- 
ticular figure frequencies were of Course main- 
tained through all of his learning blocks, 
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Fic. 3. Total errors per attempted anticipatio 
over learning blocks for Ss in Exp. ПТ. 
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Results апа Discussion 


The condition whereby two learning fig 
ures occurred twice each per learning ue Я 
surely would be expected to diminish ү" 
distinctiveness of the figure which occurre 
three times. That this occurred is indicat?" 
by the fact that 34% of the Ss made п 
C(3) errors throughout their learning үй 
blocks ; 37% made по C(3) errors on bloc lj 
two, three, or four; and 54% made поп 
on block two alone. (The corresponding 
percentages for the Ss in Group п-С к 
49, 59, and 66.) Fig. 3 shows (for the 1 
іп Exp. ПТ) curves of errors per attempt? 
anticipation to the C(3) figure, on the on 
hand, and the pooled C(2) and C(1) A 
ures, on the other hand. Ordinate value 
were determined as in Fig. 2. inc 
, Let us evaluate the effects of this dimi". 
ished distinctiveness. Table 8 contains th 


TABLE 8 


OBTAINED AND EXPECTED (IN PARENTHESES) 
RESPONSE FREQUENCIES TO Test STIMULUS 
FOR Ss IN Exp. ПІ 


C(3) Both C(2) са) 
25 31 9 
(24.4) (32.5) (8.1) 
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obtained and expected response frequencies 
of the Ss in Exp. III to Test Stimulus C. 
Responses consisting of the name of either 
of the two C(2) figures were necessarily 
pooled since, considering that the figures 
presented at each frequency were random 
for a given 5, there was no way of distin- 
guishing between them in a frequency anal- 
ysis over Ss. The expected frequencies 
were computed by the procedure described 
above. 

The closeness of the fit between expected 
values and obtained results is remarkable 
indeed. This supports the hypothesis that 
the discrepancy between obtained and ex- 
pected values for the Ss in Group II-C re- 
sulted from the unusual distinctiveness соп- 
comitant with the single figure's compara- 
tively large number of occurrences per 
block. This does not, of course, provide 
powerful confirmation to the steps of the 
argument wherein the unusual distinctive- 
ness is assumed to lead to the differential 
use of cue-producing responses and these 
їп turn to a preponderance of test responses 
Consisting of more frequent figure names. 
But there would seem to be no more com- 
peling line of reasoning available to ac- 
count for the discrepancy. Moreover, the 
presented line of reasoning was heuristic, 
and any other would be ex post facto. 

The accuracy of prediction for these 
data make it seem desirable to determine if 
the obtained results are consistent with the 
independence of expected outcomes and the 
effects of recency. Table 9 contains this 
analysis. А given entry for predicted pro- 
portion is simply the product of the prob- 
ability that the particular figure will occur 
last (or not last) during the final learning 
block and the probability, assuming the 
matching relationship holds, that the name 
of the figure will be given on the test trial. 
For example, the probability that C(3) is 
the last figure exposed is .375 and the prob- 
ability that it will be named according to 
the matching hypothesis is .375; hence, the 
Predicted proportion of C(3) responses 
When it is last is (.375)(.375) = .141. 
Again, the fit between predicted and ob- 
fained values is very close, indicating both 
the accuracy of prediction from the match- 
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TABLE 9 
OBTAINED AND EXPECTED PROPORTIONS OF 
RESPONSES IN RECENCY COMPARISON 
FOR Exp, III 
Test Stimuli 
C(3) Both C(2) C(1) 
Not- Not- Not- 
Last | Last | Last | Last | Last | Last 
Obtained -139 | .246 | .092 | .385 | .046 | .092 
Predicted .141 | .234 | .125 | .375 | .016 | .109 


ing hypothesis as well as the relative unim- 
portance of recency effects. 

Thus, the results of Exp. II and Exp. III 
taken jointly provide very convincing sup- 
port for the earlier finding of a direct rela- 
tionship between a learning variable and a 
perceptual variable. Moreover, these results 
provide a relatively clearly defined set of 
conditions under which the relationship 
breaks down. 


EXPERIMENT IV 


The purpose of the present experiment 
was to provide one test of the hypothesis 
that, in a recognition situation, discrimina- 
tive and common cues differ in the extent 
to which they are effective in determining 
responses because of differential sampling. 
Such differential sampling or, in other 
terms, selective attention has been hypoth- 
esized by certain theorists to account for 
perfect, or near perfect, discriminations in 
situations involving overlapping cues and 
common background stimulation. 

In their application of statistical learn- 
ing theory to the analysis of discrimination 
learning, for example, Estes and Burke 
(1955) found evidence for a reduction of 
oe elements in the stimulus situation 
over the course of learning. La g 
Smith (1957) explicitly eae dun 
the assumptions of statisti Е 
t of st tistical learning theory 
imply that an individual making perfect 
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discriminations іп two-choice learning 
should be selectively ignoring common ele- 
ments. They found support for the hypoth- 
esis that background common cues are ig- 
nored, in an experiment involving changes 
in partial reinforcement schedules with 
interspersed test trials consisting of only 
background elements. 


Wyckoff (1952) used the term “оБзегу- 
ing responses" to designate those responses 
of an S which are necessary for exposure 
to the discriminative stimuli, Wyckoff pre- 
sented an analysis of changes in the proba- 
bilities of these responses in terms of rein- 
forcement learning theory, equating these 
changes with alterations in stimulus gener- 
alization between the stimuli. Atkinson 
(1959) dealt with observing responses with- 
in the framework of a Markov model, in 
ап approach related to the earlier work of 
Wyckoff. He modified the typical discrimi- 
nation task significantly by equating ob- 
Serving responses with the operation of 
selection keys, in order to make the process 
of selective attention directly measurable. 
Restle (1955), on the other hand, used 
Lawrence's (1950) concept of "irrelevant 
Cues,” which are those aspects of a stim- 
ulating Situation which are unrelated to 
reinforcement. Restle hypothesized that 
these irrelevant cues become progressively 
adapted over the course of learning, and 
thus eventually cease to influence the dis- 
crimination responses of Ss. Recently, 
Restle (1957) has presented a more general 
theory of discrimination learning in which 
rate of adaptation is а function of the in- 
formation-giving value ("validity") of the 
available cues. Finally, Green (1958) de- 
scribed the process of selective sampling of 
stimulus elements in terms of a reduction 
of the intercept or common set as а result 
of postural or orientation adjustments. 

In summary, these investigators have at- 
tempted to account, by means of such con- 
cepts as "observing responses" and “сие 
adaptation," for the process whereby Ss 
learn to ignore the common (or nondis- 
criminative or irrelevant) cues in discrimi- 
nating among stimuli. The term "selective 
sampling" will be used to designate this 
process throughout the present discussion 


since the expressions "observing responses," 
"cue adaptation," and "cue neutralization 
carry connotations which are not intended. 
"Selective sampling" will imply no more 
than a designation of conditions, at a given 
discrimination point, whereby certain cues 
are effective in influencing responses while 
others are not. 

In order to test the hypothesis that s€- 
lective sampling occurs in recognition situ- 


ations of the sort investigated in the pre. 


ceding experiments of this project, use was 
made of the finding that the relative fre- 
quencies of responses to ambiguous cuen 
tend to equal the relative frequencies O 
occurrence of these responses in prone 
S-R associations. Current research woul 
indicate that selective sampling occurret 
in the preceding experiments, since the ma 
jority of Ss learned to discriminate amo 
the figures during the learning trials with 
no errors after the initial criterion of tW? 
perfect blocks was reached. The postulate 
in statistical learning theory (Estes ^ 
Burke, 1955), for example, that the proba 
bility of a response on a given trial is oars 
to the proportion of conditioned element? 
leads to the assumption that the ee 
elements in both the figures and the bac si 
ground were not sampled (that is, Weg 
ignored) in the process of response deter 


mination. d 

Paired-associates learning was again go 
in this experiment in the manner of n 
previous experiments. The present exper 
ment differed from the others, however * 
that the relative frequencies of presentation 
of the learning stimuli per tria] block We 
changed once during the course of learnins: 
The purpose of the shift was to introdu? 
à different rate of conditioning of the sti 
ulus elements. For those Ss for wh? t 
selective sampling had been operative 
the time of the shift, the ignored cues Wor 
not be conditioned at the new rate. A de. 
mination of whether common cues b 
responded to at the initial relative pr 
quency or at the relative frequency cp 
ing the shift then provided a means 0 У 
sessing if selective sampling had actu? 
occurred, 
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Method 


Apparatus. The exposure of stimuli and re- 
sponses was accomplished by the apparatus de- 
scribed in Exp. I. 

Subjects. Two groups of 38 Ss each were used 
in the experiment. They were volunteers from the 
introductory psychology sections at Indiana Uni- 
versity. 

Materials and procedure, Four learning figures 
and three test stimuli were used; these may be 
Seen in Fig. 4. А paired-associates procedure was 
used during the learning phase, in which the fig- 
ures shown in the left hand column of Fig. 4 were 


LEARNING FIGURES 


8) 


TEST FIGURE 


TEST FIGURES 


BE 


FiG. 4. Learning and test figures for Exp. IV, 


associated with the nonsense syllables RUX, TUZ, 
WOY, JOM. The matching of figures and sylla- 
bles was done randomly over Ss. The sound of 
a buzzer preceded each stimulus presentation, 

The timing of presentations and intertrial inter- 
vals was tlie same as that of Exp. I. 

For the first three trial blocks, each stimulus 
response pair was shown once per block. Then, 
assuming that a set of three learning blocks is a 
reasonable interval in which to expect selective 
Sampling to begin occurring, a differential fre- 
quency of presentation per block was used begin- 
ning with the fourth block and continuing through 
the learning phase. During these latter blocks, C; 
or C, occurred twice and the other once, while C, 
Or C: appeared four times and the other once. 
While C; and С; always appeared in the ratio of 
two to one per trial block, the particular one which 
occurred twice and the one which occurred once 
were randomly determined, In a similar fashion, 
whether С, ог C; appeared four times per block 
was randomly determined for different Ss. The 
Order of presentation was random on odd blocks 
and the reverse of the previous odd block on even 

locks, 

Upon achieving two trial blocks with perfect 
anticipations, the Ss were given a 3-min. rest pe- 
"lod. Then, during the test phase of the experi- 
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ment, they were required to achieve one further 
perfect trial block before each test trial The test 
trials consisted of the presentation of Є. Е’, and С" 
(see Fig. 4) to all Ss. Test Stimulus C' contains 
а cue set common only to Figures С, and Cs, Stim- 
ulus C" a cue set common only to C; and C, but 
Stimulus C contains nothing but cues common to 
all four learning figures. 

The two experimental groups differed in the 
ordering of the test trials. The Group IV-C' Ss 
were tested on either Stimulus C' or Stimulus C" 
first, the remaining one of these two stimuli sec- 
ond, and Test Stimulus C last. The Group IV-C 
Ss were shown C first, and then С’ and С” ran- 
domly ordered, second and third. 

For the learning instructions, the Ss in both 
groups were told that the nonsense Syllables were 
to be considered the names of the figures and that 
the syllables were to be pronounced in the process 
of anticipation. Prior to each test trial all Ss were 
told that a new figure would be presented and that 
their task was to respond with the one previously 
learned name which seemed most appropriate, 

It became apparent after these Ss were run that 
а control group was needed. This was needed be- 
cause no previous data were available for Ss tested 
on a stimulus like C after а 2:1:4:1 frequency 
throughout learning (that is, starting with 2:1 En 
during the first block and maintaining that fre- 
quency on all subsequent blocks). Accordingly, a 
group of 29 Ss was run with С, and С, shown in 
а 4:1 ratio and C; and Ci in a 2:1 ratio through- 
out learning (see Fig. 4). The test stimulus con- 
sisted of C alone. 


Results and Discussion 


If an S gives an appropriate response to 
Test Stimulus С” (that is, the name of C, 


or Cy), his subsequent response to С” must 
be different (that is, the name of C; or C,) 
or else it will be inappropriate. And the 
same holds, of course, for the sequence C” 
and then С’. In that sense, the responses 
to C’ and C" are necessarily independent 
for any analysis involving specific appro- 
priate responses, In contrast, the same re- 
sponses are appropriate to either С’ or ©”, 
on the one hand, and C, on the other, Thus, 
it is possible to have sequential depend- 
encies in terms of recurrences of the same 
response which is appropriate for both stim- 
uli. The previous experiments avoided the 
problem of possible lack of independence by 
using only those test stimuli (like С’ and 
C") in the Sroup administered to а given 
5, for Which the associated Sets of appro. 
priate responses were nonoverlapping, 
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The present investigation was so designed 
that the effects of using successive test 
stimuli for which the same responses were 
appropriate could be determined with little 
loss of experimental time. All Ss were 
shown the three stimuli C, C', and C" in 
such a fashion that C came either first or 
last. Thus, if it were found that the re- 
sponse patterning to the С’ and C", on the 
one hand, and C, on the other, are independ- 
ent, the combined data could be used; but 
if no such independence were shown, the 
data for the Ss seeing C first and those 
seeing it last could be analyzed separately, 
which would have been necessary even if 
all test stimuli had not been shown to each 
$ as in the present investigation of inde- 
pendence. 

Table 10 contains the response frequen- 
cies to the test stimuli for the Group IV-C’ 
and Group IV-C Ss. The response pattern- 
ing would seem to be quite different for the 
two groups, with a distinct tendency for 
lower frequency responses to occur to C 
when C’ or C" came first and to С’ and C^ 
when C came first. Thus, the ratio of more 
frequent to less frequent responses is 2:1 
for C" and 2.5:1 for С’ in the case of the 
Group IV-C’ Ss, and the corresponding ra- 
tios ате 1.5:1 and 1.7 :1 for the Group IV-C 
Ss. These data may be compared with those 
for Group П-АВ because of the similarity 
of learning frequencies and test conditions 
in the two situations. The ratio of more 

frequent to less frequent responses for the 
Ss in Group П-АВ is 22:1 to Test Stim- 
ulus A (comparable to C") and 3.8:1 to 
Test Stimulus Б (comparable to С’). This 
comparison adds some weight to the evi- 


dence of nonindependence in test responses 
when the same response is appropriate to 
successive test stimuli, despite the marked 
discrepancy in response tendencies between 
C' and B. It is surely clear that the response 
ratios to C’ and C" are much closer to the 
results for the Ss in Group П-АВ when С 
and C" appear prior to C than when they 
appear after C. A 

In the case of the responses to Test Stim- 
ulus C, there is an increase, when C comes 
last, in the responses which occur at fre- 
quencies of two and one at the expense О 
those of frequency four (see Table 10). 
The results from the group of 29 control 
Ss are relevant here. These were: 16 re- 
sponses associated previously with the fig- 
ure occurring four times, 7 responses which 
occurred twice, and a total of 6 of the two 
responses which occurred once each, These 
data are much more nearly like the results 
when C was presented first than when it 
was presented last. 

All the data from this experiment, as 
compared with the results for Group IT- 
and for the control group of 29 Ss, indicate 
that the learning blocks of Exp. IV sub- 
sequent to the third were not completely 
effective in overcoming the response а 
encies stemming from the three initial bloc А 
of equal frequency. That is, when the fou" 
learning figures are presented equally fre 
quently for three blocks and then shifte! 
to a 2:1:4:1 frequency, there is an increas 
in the relative frequencies during testing 
of naming responses of figures which od 
curred only once per learning block, ?" 
compared with the results when no shift 0^ 
curs and the frequency 2:1:4:1 is main 


TABLE 10 


RESPONSE FREQUENCIES (For Ex». IV) IN 
ASSOCIATED STIMULI 


TERMS OF THE LEARN 


ING FREQUENCIES OF 


с” ? 
Number First Test — E 1 
Group of Ss Stimulus 2 1 Error 4 1 Error 4 F | 
IV-C' 38 C' or C" 220 Ji g 20. 3$ 0 g. 17 | 
IV-C 38 С 1; i $ 2 B & |p 38 
Note.—Numbers in columns headed by C", Cr, 


appropriate. 


i 
re 
and C refer to frequency of occurrence, during learning, of named fig" 
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tained throughout learning. Moreover, as 
pointed out above, there is a further in- 
crease in the frequency of test responses 
consisting of the names of figures which 
occurred once per block when the test stim- 
ulus followed (for a given 5) other test 
stimuli having the same appropriate re- 
Sponses. This latter increase may have re- 
sulted from the attempts of Ss to use as 
many different testing responses as possi- 
ble. That is, there were obviously more 
high than low frequency responses initially 
апа so any general tendency toward using 
different responses on subsequent test trials 
would lead to a net shift in the direction of 
low frequency responses. MN 

In any event, as a result of the indicated 
lack of independence, only the C’ and C 
test responses of Group IV-C’ and the C re- 
sponses of Group IV-C were used in the 
analysis of selective sampling. ' | 

Table 11 contains the distribution of er- 
rors made subsequent to the first required 
criterion level of two blocks with perfect 
anticipations. It should be remembered that 
all Ss were required to achieve one further 
perfect block before each test trial. The 
deviation from perfect responding during 
criterion trials may conceivably have re- 
sulted from inadequate selective sampling 
of discriminative cues. Consequently, each 
of the two groups was divided into two 
parts: one part contained those Ss who 
made no errors whatsoever in the learning 
blocks following the first two required per- 
fect ones, and the other part contained Ss 
who made one or more errors during these 
blocks. For ease of designation, Group 


TABLE 11 


TOTAL ERRORS OF ANTICIPATION MADE 
BY Ss IN Exe. IV IN LEARNING BLOCKS 
FOLLOWING First PERFECT CONSECUTIVE PAIR 


Errors Number of Ss 
0 44 
1 15 
2 12 
3 2 
4 T 
5 1 
6 1 


ТУ-С-МЕ shall refer to those 5$ who were 
shown Stimulus C last and made no errors 
during the criterion blocks, Group IV-C’-E, 
to those Ss who were shown Stimulus C 
last and made one or more errors during the 
criterion blocks; and Group IV-C-NE and 
Group IV-C-E shall have similar meanings. 

The curves, over all learning blocks, of 
average number of errors per S in each of 
the four subgroups may be seen in Fig. 5. 
There is some difference in learning trend 
evident between the Group IV-C'-NE and 
Group IV-C'-E Ss, on the one hand, and 
the Group IV-C-NE and Group IV-C-E Ss, 
on the other. It is hardly surprising that 
the "NE" Ss as a group were more efficient 
learners than the "E" Ss. There would 
secm to be little difference between the 
learning rates of the Ss in Group IV-C/-NE 
and those in Group IV-C-NE during the 
later blocks, and these will be the critical 
comparison groups in the analyses of selec- 
tive sampling. The average number of 
blocks to reach the final necessary perfect 
criterion block for each of the subgroups 
is as follows: Group IV-C’-NE, 11.24; 
Group IV-C’-E, 12.61; Group IV-C-NE, 
10.58; Group IV-C-E, 12.95. The differ- 
ence between the Group IV-C'-NE and 
Group IV-C-NE Ss is small and far from 
significant. 


We would expect, if selective sampling 
occurs, that it does so for the Ss at high 
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learning levels who show perfect discrimi- 
nations over the criterion blocks. 

Table 12 contains a summary of the test 
responses of the various subgroups in terms 
of the frequencies of occurrence of the as- 
sociated learning figures. Responses con- 
sisting of the name of a figure which oc- 
curred twice or four times per block have 
been pooled under the "More Frequent" 
heading, while those consisting of a name 
occurring once per block have been pooled 
under “Less Frequent.” All erroneous test 
responses were eliminated from considera- 
tion. The pooling into these two categories 
was done because of the relatively small 
frequencies which would have resulted from 
the finer breakdown involving the actual 
numbers of learning occurrences. 

Expected frequencies for the pooled data 
can be computed by the usual rules for 
combining fractions. Thus, for example, 
the "More Frequent" responses should oc- 
cur three times as often as the “Less Fre- 
quent" responses if the Ss in Group IV-C- 
NE responded to the test stimuli at the 
same relative frequencies as the occurrence 
of the relevant figure-name pairs during 
later learning. If selective sampling had oc- 
curred prior to the fourth block, however, 
an equal number of C responses in the 

More _ Frequent” and "Less Frequent" 
categories would be predicted. That is, if 
the common cues were not sampled as a 
result of selective attention, they could not 
be conditioned to the various responses at 
the differing frequencies. 

Stimuli С’ and C" contain discriminative 
cues which could not be ignored if Ss were 
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to give correct anticipations during learn- 
ing. Hence, approximately a 2.82 to 1 ratio 
of “Моге Frequent" to "Less Frequent 
responses was expected for the Ss in Group 
IV-C'; the expectation is approximate rath- 
er than exact because of the previously 
noted ineffectiveness of the shift to 2:1 41 
in completely overcoming the response tend- 
encies stemming from the first three blocks 
of equal freqency. The test for the hypoth- 
esis of selective sampling involved A 
Group IV-C-NE Ss, since they responde 
to the test stimulus containing only common 
cues and showed perfect discriminations at 
criterion trials. Table 13 shows observed 
and expected values for the Ss in Group 
IV-C-NE and also for the 5$ in Group 
IV-C-NE. 

The observed and expected values do { 
differ significantly for the Group IV-C-N 1 
Ss (Р > 25) using a x? test. The observet 
values for the Group IV-C-NE Ss -— 
significantly from that expected if they dic 
not selectively sample (P = .05), but do not 
differ significantly from the values expectet 
under the assumption of selective sampling 
by the fourth block (P = 1.00). И 

It is interesting to note further (see 
Tables 12 and 13) that the Ss in Groub 
IV-C-E responded very close to the Ё 
learning ratio (14.25 to 4.75 expected VS: 
15 to 4 observed). It is surely clear that 
the Group IV-C-E Ss did not show selec? 
tive attention. It is conceivable that the", 
comparative learning inefficiency resulted 
from their deficiency of selective samplin? 
among common and discriminative CW” 
Noting the response ratios of the GrouP 


TABLE 12 


RESPONSE FREQUENCIES IN TERMS oF LEARNING FREQUENCIES OF ASSOCIATED STIMULI FOR EXP. IV 


Performance After 


р of 
Learning Occurrence iy 


1 man Numb » 
Group First Criterion Blocks of Se and oret вав 
IV-C'-NE No errors 
25 , 14 
IV-C'-E One or more errors 13 b. = En T 8 
IV-C-NE No errors 19 ES 19 9 
IV-C-E One or more errors 19 C i 4 


m —fs s 
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TABLE 13 


OBSERVED AND EXPECTED RESPONSE FREQUENCIES FOR Ss or Exp. IV 


Learning Occurrence of Stimulus-Response Pair 


Group Value Source More Frequent Less Frequent 
IV-C'-NE Observed | 33 P 14 
Expected (Differential Frequency Matching) 34.67 12.33 
Observed Г 10. 9 
IV-C-NE Expected (Selective Sampling) : 9.5 $5 
Expected (Differential Frequency Matching) 14.25 4.15 


IV-C-E and the Group IV-C-NE Ss in 
Table 12 (3.75 : 1 and 2.36 : 1, respec- 
tively), the relatively low number of “Моге 
Frequent" responses on the part of the 
IV-C'-E. Ss becomes difficult to account for 
on any basis other than sampling fluctua- 
tion. If the data for the Ss in Groups IV- 
C’-E and IV-C-E are combined, they give 
а total of 31 “Моге Frequent" and 12 "Less 
l'requent" responses. This is very much 
like the response ratio for the Group IV- 
C-NE Ss. 

То summarize, this experiment provided 
а test of the hypothesis that responses to 
ambiguous cues are a function of differen- 
tial attention to discriminative as compared 
to common stimulus elements during the 
process of learning the names of relevant 
stimulus figures. The test was based on the 
usual presumption that learning only oc- 
curs with cues actually sampled. This being 
the case, it was assumed possible to detect 
cessation in the sampling of certain cues by 
an initial learning experience followed by 
an altered learning experience subsequent 
to the assumed cessation. If selective sam- 
pling occurred to the given cues, test re- 
sponses to them at the end of the learning 
period should have been in accord with the 
initial learning; while, if selective sampling 
had not occurred, the responses should have 
been in accord with altered learning. 

Only Ss at high criterion levels of learn- 
ing were used since it appeared likely that 
lower levels of learning may have resulted 
from poor selective attention. It was found 
that Ss who were shown discriminative cues 
responded in accord with the later (altered) 
learning experience, while 5s shown only 


common cues responded in accord with the 
initial learning experience. This supported 
the hypothesis of selective sampling in per- 
ceptual-type situations involving common 
and discriminative cues. 

The data are in accord, moreover, with 
the findings of La Berge and Smith (1957) 
that a change in the partial reinforcement 
schedule associated with the common cues 
shown on test trials produces no change 
in the level of response for Ss near asymp- 
tote but does produce a change for 55 
further from asymptote. While the theory 
of Restle (1955, 1957) on the adaptation 
of common cues accounts for the general 
findings, it does not contain a sufficiently 
clear formulation of the relationship of 
adaptation to conditioning. Thus, the Ss 
in the present experiment responded quite 
accurately to the common cues which had 
presumably been previously adapted. Restle, 
to be sure, recognizes the phenomenon in 
his recent statement (1959) that "irrelevant 
cues are ‘neutralized’ relative to the relevant 
cues present, and recover if the relevant 
cues are removed or scrambled" (p. 14) as 
well as in his previous comment (1958) 
that, when certain cues are removed, invalid 
cues recover and performance tends to 
change. But it is not apparent just where 
these observations fit into his more formal 
theory of discriminative learning. 

It would seem more economical at the 
present time to consider the sampling and 
conditioning processes separately in accord 
with the formulations of Estes and Burke 
(1953). In this way a given stimulus ele- 
ment may be conditioned to a response but 
be ineffective in eliciting the response be- 
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cause it is not sampled. The probability of 
sampling the given element may be thought 
of as going to zero during the process of 
selective attention. When the stimulus situ- 
ation is changed sufficiently so that the ele- 
ment is again sampled, its previous condi- 
tioning is still effective. 


EXPERIMENT V 


In all of the experiments reported so far 
the Ss learned to associate nonsense names 
with nonsense figures which contained over- 
lapping cues. Following the learning, the 
Ss were required to name a test stimulus 
which contained only cues which were com- 
mon to two or more of the previous learn- 
ing figures. There was, in the terms used 
in the introduction, ambiguity as a result 
of reduction of information when the full 
set of cues was necessary for unique dis- 
crimination. Moreover, the test situation 
and analysis were devoid of phenomenologi- 
cal or “autistic” considerations. Surely it 
would not be easy to argue that an 5 who 
called a given test stimulus a RUX "really 
saw" the figure with which that name had 
been associated. Such phenomenological 
constructs could be anchored by other as- 
pects of the behavior of the 5 as well as by 
appropriate questioning. These aspects of 
perception will be discussed more f. ully later. 

Nevertheless, these considerations led to 
Xp. V. The test stimuli of Exp. V differ 
from those of Figs. 1 and 4 т that their 
ambiguity results from too many, rather 
than too few, cues; and in addition, there is 
probably more of a "really see" reaction 
associated with responses to them. Perhaps 
these should not be listed as two factors be- 
cause of their close interdependence, but it 
is convenient to do so to highlight the sepa- 
rate aspects. 


The learning and test stimuli used in this 
experiment were based on those employed 
by Schafer and Murphy (1943). These ex- 
perimenters hoped to demonstrate that per- 
ception was influenced by "needs." They 
attempted to show that, when Ss were re- 
quired to name an ambiguous presentation 
consisting of a combination of a Previously 
rewarded and previously punished profile, 


е) 


i 
. 
the latter would form the ground while the | 
former would stand out as figure. The 
stimuli used by Schafer and Murphy were 
identical to those in Fig. 6 except that each 
of their learning figures also included the 
eye dot of the profile that would be its com- 
pliment when it was presented as part of а 
test figure. 


TEST FIGURES. 


LEARNING. FIGURES, 
F 
Е Fo 


db © 


Fic. 6. Learning and test figures for Exp. уа 


In the Schafer and Murphy experiment 
Ss were shown the four half-moon p 
and were required to learn their names. ^R 
each S one face in each pair (such as ^! 
ог Е.) was regularly rewarded (2 OF 5 
cents given to S) while the other face ke 
regularly punished (2 or 4 cents taken fro 
S). The test stimuli of Fig. 6 which co” 
tain the combination of both figures О -e 
pair, one rewarded and one punished, wie 
then shown to the Ss under instructions р 
name them. Fifty-four rewarded as ОР 
posed to 13 punished names were given 
the test stimuli. 

Rock and Fleck (1950) repeated i 
Schafer and Murphy study but failed 
confirm the previous findings. However," 
pointed out by Jackson (1954), there BE 
several changes in the Rock and Fleck рс 
cedure that may have accounted for 
differences in results. Among the chanh р. 
were: younger Ss, the.use of symbols rat rd 
than proper names, the change in “rewe 
value” of 2 and 4 cents since 1943, a 
use of a projection tachistoscope whic 
sulted in larger faces, lessened clarity: ^ 


he 
to 


Tue EFFECTS or EXPERIMENTALLY CONTROLLED EXPERIENCE UPON RECOGNITION RESPONSES 23 


the absence of a constantly lighted field. 
Jackson repeated the Schafer and Murphy 
study with some modifications, including in- 
creased rewards, and once again showed 
that rewards and punishments had their 
predicted effects. However, when he sub- 
stituted a projector, for presentation pur- 
poses, with a second group of Ss, his find- 
ings concurred with those of Rock and 
Fleck. | 

Smith and Hochberg (1954), substituting 
15-in. solid figures for the outline profiles, 
demonstrated that a shock vs. no shock 
condition resulted in findings similar to 
those obtained for monetary reward and 
punishment. Ss gave a significantly larger 
number of nonshocked figure names when 
they were required to name ambiguous test 
figures. 

More recently Solley and Sommer 
(1957) replicated the Schafer and Murphy 
experiment using children, increased mone- 
tary rewards, and solid profiles. They ob- 
tained a significantly greater number of re- 
»warded names when their Ss were shown 
the ambiguous test presentations for one- 
min. periods and were asked to describe 
what they saw. In a second experiment, 
these authors found that Ss reported the 
rewarded profiles as "happier," "brighter," 
“closer,” and as having a dark line drawn 
around them. Solley and Sommer con- 
cluded that reward "produced a genuine 
figure-ground reorganization" (p. 9). 

Considering these results the purpose of 
this experiment may now be stated: (a) to 
test predictions of the relationship between 
relative frequency of S-R association and 
relative frequency of naming identification 
and (b) to compare the effects of fre- 
quency upon responses with those of re- 
wards and punishments. In addition, be- 
cause of certain modifications introduced 
in the present experiment, some informa- 
‘tion about the generality of previous find- 
ings might be established. 


Method 


Subjects. The Ss were 66 volunteers from the 
Courses in introductory psychology at Indiana 
University, The learning criteria were not met by 


two of these Ss and, therefore, the data írom 64 
Ss were included in the analyses. 

Apparatus. The apparatus employed was essen- 
tially the same as that described in Exp. IL The 
only important difference was in the size of the 
exposure mirrors, which were 3 in. by 3 in, in the 
present experiment. 

Learning and test materials. Each S was re- 
quired to learn the nonsense names of four half- 
moon faces. The four faces used for any S were 
either direct photographic reproductions of those 
faces employed by Schafer and Murphy with the 
eye dot exception as noted previously (see Fig. 6) 
or photographic reproductions of the negative 
images of these faces. This pool of eight figures 
was such that figures could be paired on the basis 
of shared common contours. The test figures con- 
sisted of these paired figures, Thus, if Е, Е. Gi, 
and С: were shown during the learning phase, the 
pairings F, and Е. to produce Е and С, and С. to 
produce G were the two test figures. 

The test figures were 224-in. photographic repro- 
ductions mounted on cardboard which, upon pres- 
entation, were scen centered in the appropriate 
window. А given face, whether it appeared indi- 
vidually or as part of a test figure, never varied 
from a set location of exposure. The nonsense 
syllables were printed in %-in. india ink capitals, 
on cardboard, such that on presentation they were 
seen as centered in their window. The four non- 
sense syllables used were WOY, JOM, RUX, and 
TUZ. 

Procedure. The procedure was similar to the 
one used in the previous experiments. However, 
the following intervals were used: exposure of 
stimulus 4 sec., exposure of response 2 secs., inter- 
val between trials 6 secs, and delay after warn- 
ing buzzer 1 sec. 

Eight trials constituted a learning block, On 
any block, one member of a pair containing com- 
mon contours appeared four times while the other 
member appeared once. The ratio of the frequen- 
cies of presentation of the members of the other 
pair was two to one. Ап example of the relative 
frequencies of presentation of the four faces for 
a given S may be seen in Table 14, 


TABLE 14 


NUMBER OF OCCURRENCES OF LEARNING FIGURES 
PER TRIAL BLOCK FOR ONE POSSIBLE 
ASSIGNMENT IN Exe. V 


Learning 


Figure Frequency 


n 
Fs 
Gı 
G: 


ewe 
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The order of presentation of the trials for each 
block was random. An attempt was made to con- 
trol for possible biases for particular faces and 
directions by the use of a counterbalanced design, 
such that for a given S the particular group of 
pairs used, the pair that was shown in the four 
to one ratio, and the member of higher frequency 
within each pair were randomly determined. 

The learning procedure continued until an 5 
anticipated perfectly on two consecutive trial 
blocks. After a rest period of 3 min., the 5 was 
told that following the next perfect block he 
would be shown a new figure and would be re- 
quired to give one of the names of the previously 
learned figures. Following this perfect block the 
S was reminded of his task and then shown a test 
stimulus, The test stimuli were shown for a dura- 
tion of 1% secs., beginning 1 sec. after the sound- 
ing of the warning buzzer. Aíter the first test re- 
sponse, the 5 was required to anticipate perfectly 
another block and was then shown the second test 
figure. The order of presentation of the test fig- 
ures was random, with the restriction that the 
figures corresponding to the 4:1 and 2:1 pairs 
were shown first an equal number of times. 


Results and Discussion 


Table 15 shows the observed and pre- 
dicted frequencies of responses to the test 
stimuli. The column headed by “Four” 
contains the number of obtained and pre- 
dicted responses designating the name of a 
face that appeared four times per learning 
block. The other columns correspondingly 
contain the number of responses designat- 
ing figures which were shown at the fre- 
quencies shown in the column heading. The 
errors are inappropriate responses. Refer- 
ring to Fig. 6, if the Test Figure G were 
shown, a response consisting of the name 
of Е, or Е, would be inappropriate. If the 
components of the Test Figure G had been 
presented in a four to one ratio, an error 
made to this figure would be represented in 


the third column, and if the ratio were two 
to one, in the last column. 

The predicted frequencies refer to the 
number of appropriate responses expected 
on the basis of the hypothesis that the rela- 
tive frequencies of appropriate responses to 
ambiguous cues are equal to the relative 
frequencies of presentation of the names 
associated with these cues during the learn- 
ing phase. 


The differences between expected and ob- 
served values (at least for the 4:1 compar. 
son) are quite large, especially as compare 
with the very close fits obtained in the previ- 
ous experiments. Using a X^ test, the au 
served and predicted frequencies differ at 
the .01 level in the 4:1 comparison and at 
the .20 level in the 2:1 comparison. The 
hypothesis of the matching of relative fre- 
quencies of responses and relative frequen” 
cies of occurrence during learning trials " 
this type of situation is certainly not We 
supported by these data. 

There are a number of differences be 
tween the present experimental conditions 
and the earlier ones in this project. First. 
the specific learning figures differ; secon’ 
the present figures are more meaning! 
than those used previously; third, the am 
biguity in the test stimuli resulted from an 
increase of cues in the present case, whi 
it resulted from an impoverishment of cues 
in the previous experiments; and fourt? 
while certain cues in the previous exper" 
ments were on a partial reinforcemen 
schedule, all cues in the present experime? 
received 100% reinforcement. There is ont 
other difference which, though less obviow® 
than any of the preceding, seems of utmos 
importance. The configurations of cues ! 


TABLE 15 


OBSERVED AND EXPECTED REsPONSES TO THE TE 


ST STIMULI IN Exp. V 


Appropriate Inappropri i iate 
paate Appropriate Inappropri? 
Learning ашы Responses Responses Response 
Figures Four One Two One 
Observed 40 21 
Predicted 48.8 12.2 36 7j cd 3 i 


* 
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TABLE 16 


NuMBER ОЕ Ss* SWITCHING AND REPEATING RESPONSES ON THE Two TEsTINGs IN Exp, V 


Second Second 
Response Response 
Frequency Frequency 
aes See | ae 
Row Row 
Four One Sum Two One Sum 
First Two 9 8 17 First Four 13 $ 21 
Response Response 
Frequency One 6 2 8 Frequency One 2 5 7 
Column Sum 15 10 Column Sum 15 13 


a Only Ss who gave two appropriate responses are included in analysis. 


the stimuli of the present experiment were 
such that the second test stimulus was ob- 
vious to an 5 once the first test stimulus 
had been seen. This was not the case in the 
previous experiments where the learning 
figures were nonsensical and the test con- 
figurations were arbitrary choices among 
the cues of these figures. Thus, while the 
first and second responses were independent 
in the previous work, the ability of an S to 
predict that the second compound face 
would be shown once the first had been ex- 
posed could lead to a lack of such inde- 
pendence. 

Table 16 divides the appropriate test re- 
sponses into categories defined on the basis 
of first and second responses. Thus, for 
example, it contains the number of Ss who 
gave the names of the learning figures of 
higher frequency on both trials, among 
those who were presented on the first test 
trial with the stimulus whose components 
occurred in a 4:1 ratio and on the second 
trial with the stimulus containing 2:1 com- 
ponents (that is, 13). The results for the 
other combinations are similarly included. 
The differences between response tendencies 
on the first and second trials are not statis- 
tically significant (using Fisher’s exact 
test). However, the very small sample 
sizes make it extremely unlikely that a sig- 
nificant difference would be found even if 
the obtained differences were much more 
extreme. Nevertheless, there are certain 
tendencies evident in the table: (a) a sur- 


prisingly large number of marginal re- 
sponses of low frequency on the second 
test trial, (b) Ss who give responses of low 
frequency on the first test trial to the com- 
pound face with components shown 4:1 
give more responses of low than of high 
frequency on the second test trial, (c) the 
relative frequencies on the second test trials 
tend to vary with the particular first test 
response given (though not significantly). 
A conclusion of nonindependence is con- 
sistent with the findings of Schafer and 
Murphy (1943). They encountered, in a 
pre-experimental phase, a tendency for Ss 
to respond, on repeated testings, with the 
names of faces pointing in a particular di- 
rection. They attempted to counteract this 
effect by inserting, between each test trial, 
à configuration with only one acceptable 
face which pointed in the direction opposite 
that of the face selected on the previous 
ambiguous trial. However, they found, de- 
spite this attempt at obtaining trial-to-trial 
independence, that after approximately 
eight exposures of an ambiguous figure, 
equal percentages of rewarded and punished 
faces were given. They accounted for this 
finding in terms of a "new set" and con- 
sequently only considered the responses to 
the first half of their test presentations. 
Pastore (1949) criticized Schafer and 
Murphy's analytical procedure since he felt 
that the cut-off point was а past hoc choice 
dictated by the fact that the hypothesis w 


as 
supported only up to the se 


lected point. 
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Chein, Lane, Murphy, Proshansky, and 
Schafer (1951) countered by saying that 
the choice of the sixteenth trial allowed a 
wide “margin of safety." They pointed out, 
further, that the responses aíter the six- 
teenth trial were not random, rather "there 
was clear evidence that another process in- 
truded and took over" (p. 134). 

Rock and Fleck (1950) referred to "a 
certain consistency which develops in re- 
sponding to a situation where there is no 
check on correctness or incorrectness after 
each trial" (p. 775). They suggested that 
"a better, though practically infeasible, pro- 
cedure would be to give, say, 100 55 each 
one trial with each ambiguous situation in 
order to avoid the contaminating effect of 
earlier presentations of ambiguous figures 
upon subsequent ones" (p. 775). 

On the basis of these observations, the 
first test response of each $ was considered 
separately for further analysis. Table 17 
contains the observed and predicted ге- 
sponses to the first test stimuli. The devia- 
tions of observed results from those pre- 
dicted are indeed small and very far from 
being statistically Significant. Thus, there 
15 some support for the hypothesis that 
even in a stimulus situation of the sort used 
in the Present experiment, the relative fre- 
quencies of the various first test responses 
to the ambiguous cues tend to be equal to 
the relative frequencies of occurrence of 
these responses to their associated cues dur- 
ing learning. This does not hold for the 
second test responses, apparently, because 
of a lack of independence. Since the orig- 
inal predictions were made for both test 
trials and the reasoning toward lack of inde- 
pendence made a posteriori, these results 
must be cross-validated. 


- 
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Considering the two testings separately, 
there are two types of biases that may have 
been operating to influence the responses 
given to the ambiguous figures. The first 
of these is a preference for giving the 
names of figures pointing in a particular 
direction. Jackson (1954) found a left- 
facing preference for the G figure and sub- 
stituted a new profile set in his replication 
of the Schafer and Murphy (1943) exper 
ment. Subsequently, he found no direc 
tional preferences. However, when he 
employed the Rock and Fleck (1950) pro- 
jection procedure he found a left-facing 
preference. In the present experiment к 
analysis of directional preferences потеки 
а left-facing tendency for Test Figure 
and its mirror image F’, and also for Te 
Figure G and its mirror image G’, althoug 
the preference was significant only in th 
former case (Р = .05). 

The second bias that might have bee? 
operating is a structural one: the naming 
of that face of the test figure that is more 
easily seen as a face, that is, “stands 
out more.” For example, if F, were more, 
readily seen as а face than F,, its name 
would be given more often independently ? 
learning presentation frequency and direc" 
tional considerations (right facing in M 
left facing in F’). In the present expe 
ment no such preferences appeared, as n 
of the test figure presentations produced ‘4 
significantly greater proportion of name? 
of a particular profile than could be e 
pected by chance (P > 30, P > .20 for 
and F', and С and G', respectively). 


In Summary of the work with figures D 
the Schafer-Murphy type a number We 
Investigations have demonstrated that рё 


TABLE 17 
OBSERVED AND EXPECTED RESPONSES TO First Test STIMULI IN Exp, V 
Appropriate Ina i i iat 
ppropriate Appropriate Inappropt 
Responses Respo 5 ui 

TN ponses Responses Respon: 
Figures Four One Two One 
Observed 22 T 17 j 7 
Predicted 23.2 5:8 16.7 à 3 
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ceptual responses could be predicted by an 
analysis of the previous learning situation 
in terms of rewards and punishments. The 
present analysis, however, not only enables 
us to predict a significant result in a given 
direction but also enables us to predict an 
exact quantitative relationship. And while 
this prediction held for the first testing re- 
sponses only in the present case, the pro- 
cedure of analysis involving the discarding 
of nonindependent data is the same as that 
used by Schafer and Murphy (1943) (who 
also discarded half their test) ; in fact, this 
mode of analysis conforms to the sugges- 
tions of Rock and Fleck (1950) who be- 
lieved this scheme would provide the most 
cogent test of the hypothesis under con- 
sideration. 

The evidence from these experiments, 
then, points toward the importance of fre- 
quency of S-R pairing during learning in 
determining responses to ambiguous cues. 
In fact, it indicates, under quite general 
conditions, а precise quantitative relation- 
Ship between the variables learning fre- 
quency of figures and response frequency to 
figure parts. But, as discussed above in the 
introduction, there are many other factors 
which apparently influence the choice of a 
particular response out of a larger group of 
equally accurate responses to a set of am- 
biguous cues. Some of these are food 
deprivation (Sanford, 1936; Sanford, 1937 ; 
Levine et al, 1942), monetary reward and 
punishment (Schafer & Murphy, 1943; 
Proshansky & Murphy, 1942; Solley & 
Sommer, 1957), electric shock (Smith & 
Hochberg, 1954), and pleasant and un- 
pleasant stories (Solley & Sommer, 1957). 
The remaining experiments in this project 
were directed toward investigation of a few 
of these types of differential reinforcement. 


EXPERIMENT VI 


Recent studies in operant verbal condi- 
tioning have shown that the response of 
“Good” on the part of the experimenter is 
effective in producing a significant increase 
in selected response categories. Thus, for 
example, Taffel (1955) found that the re- 
Sponse “Good” was an effective reinforcer 


in leading to a significant increase in the 
use of the particular pronouns with which 
it was associated. And Binder, McConnell, 
and Sjoholm (1957) found that “Good” 


. Was reinforcing in that it led to an increas- 


ing usage of “hostile” rather than “neutral” 
verbs in sentences. (For a thorough review 
of the literature on verbal conditioning see 
Krasner, 1958.) The conditioning effect 
in these studies was found even though the 
tasks were not presented as learning ones 
and the Ss were unaware of both the rein- 
forcement contingency and the actual learn- 
ing process. In the present series of experi- 
ments, interest was of course not centered 
in acquisition during the learning phase but 
rather in the choice among responses to re- 
duced cues subsequent to the achievement 
of criterion. The question was whether or 
not the experimenter’s response of “Good” 
would produce a test bias in favor of the 
figure-name pair with which it was previ- 
ously associated, much as it led to a bias 
toward a particular class of responses dur- 
ing the acquisition in verbal conditioning. 


Method 


Subjects. Twenty-nine Ss from the introductory 
psychology sections at Indiana University were 
used. 

Apparatus and materials. The same apparatus as 
that discussed in Exps. I, II, and III was used. 
The stimulus materials may be seen in Fig. 7. The 
dimensional and other specifications for the stim- 
uli may be found in the Methods section of Exp. I. 
The response syllables associated with the stimuli 
were JOM, VED, NAF, and RUX. 


LEARNING FIGURES 
EXPERIMENTS MAME 


TEST FIGURES 


TEST FIGURES 
EXPERIMENT УГ 


EXPERIMENT XI 


=. & ГЇ 
[о]; 


-earning and test figures for Exps. VI 


Fic. 7. I 
and VII. 
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Procedure. The figures and their associated non- 
sense syllables were shown to the Ss by the method 
of paired associates. Figures and names were 
paired randomly over 5$. Except for the experi- 
menter saying "Good" aíter certain of the Ss' re- 
sponses, the learning trials proceeded in the manner 
of the previous experiments. 

A learning block consisted of a single appearance 
of each stimulus and its response. The experi- 
menter said "Good" immediately aíter 5 gave the 
correct response to either А, or A:, on the one 
hand, and either Ci ог C; on the other. While the 
particular stimulus in each pair (that is, Аз or A: 
and C; or С:) associated with "Good" varied ran- 
domly over Ss, all Ss were reinforced for one 
member in each of these pairs. The reinforcement 
was given after the correct responses to the se- 
lected stimuli, irrespective of whether 5 had previ- 
ously learned the correct response or had to read 
it upon exposure in the right window. 

Initial criterion requirements consisted of two 
trial blocks of perfect anticipations, and then one 
more perfect block after a 5-min, rest period. 
The first test trial came next, followed by learn- 
ing trials to one more perfect block, and then the 
second test trial. Figures A’ and С’ (see Fig. 7) 
were shown during the testing, one (selected 
randomly) on the first test trial and the other on 
the second test trial. Ss were instructed to respond 
with the “most appropriate” name during the test- 


ing, but no reinforcement was given regardless of 
the actual choice. 


Results and Discussion 


The frequencies of the various responses 
to the test stimuli are shown in Table 18. 
The heading “Good” includes names given 
to the test stimuli which were associated 
With reinforcement during the preceding 
learning. The heading Not “Good” has 
similar meaning. There is 


imil n obviously no 
significant difference between 


the propor- 


TABLE 18 


FREQUENCY ОР OBTAINED TEST RESPONSES 
FOR THE 5$ IN EXPERIMENT VI 


Appropriate Inappropriate 
Responses Responses 
Not Not 
“Good” “Good” “Good” “Good” 
First Test 
Trial 12 13 4 0 
Second Test 
Trial 11 15 2 1 


tions of reinforced and nonreinforced те 
sponses within the category of appropriate 
responses. There is a slight increase in the 
number of Not “Good” responses relative 
to the number of "Good" responses from 
the first to the second test trials. This may 
be related to the experimental condition 
whereby no reinforcement was given during 
the testing. It is interesting to note that т 
six of the seven cases where an inappro- 
priate response was given it consisted of 4 
reinforced response. 

The results of this experiment are con- 
sistent with the statement (in a footnote) 
by Jackson (1954), “А third experiment 
substituting simple verbal ‘rewards’ (e€-$» 
‘good,’ ‘that’s right’) for monetary influ- 
ences was attempted, but results from EE 
subjects suggested that these were not sig 
nificantly effective in modifying figure 
ground perception” (p. 340). 

Although the experimenter's response 
"Good" apparently had no еЙесё on the 
choice of names in test responses to the 
ambiguous cues, the question arises а5 to 
whether or not it had any effect during the 
learning process. That is, was there faster 
learning of the figure-name pairs assot! 
ated with “Good” as compared with the 
figure-name pairs for which the expe" 
menter gave no response? Fig. 8 give$ ; 


о 
z 
g 
= 30 
ш 
u 
o 
& 20 
ae 
z 
2 
2 
a 10 
Е 
o 
E 
о 24 
2 4 6 8 о ра в |8 20 22 
LEARNING BLOCK E 
, Fic. 8 Total errors over learning blocks fof 
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learning curves of total number of inco. 
rect anticipations per learning block ety 
all Ss for both “Good” and Not ‘G0? 
figures. There is little basis to believe 
there is any difference in the efficiency as 
learning names reinforced by “Good 4. 
compared with names not so reinforc® 
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Of course, the learning situation under 
consideration is quite different from the 
situation in the typical verbal conditioning 
experiment, where all response classes are 
immediately evident and learning is meas- 
ured by an increase in the usage of one of 
these response classes at the expense of 
others. 

The evidence, then, indicates little effect 
of the experimenter's response of “Good” 
upon the determination of responses to am- 
biguous cues or upon the facility with which 
responses in paired-associates learning are 
acquired. Accordingly, in the next experi- 
ment attention was focused upon a type of 
stimulation which seemed more likely to 
produce a learning effect. 


EXPERIMENT VII 


Electric shock was used as punishment 
in this experiment. The effects of punish- 
ment upon the determination of responses 
to ambiguous cues have been studied by 
Schafer and Murphy (1943), Proshansky 
and Murphy (1942), Rock and Fleck 
(1950), and Jackson (1954). But in these 
experiments any effects of punishment were 
completely confounded with those of re- 
ward, since one set of cues in every am- 
biguous configuration was previously always 
associated with the award of money while 
the other set of cues in the ambiguous con- 
figuration was associated with the with- 
drawal of money. 

Smith and Hochberg (1954) used Scha- 
fer and Murphy type figures to investigate 
the effects of electric shock upon perceptual 
responses. Each face was exposed for $ 
sec. during learning trials, with one face in 
each reversible pair always associated with 
electric shock. As in previous studies, 63 
presentations of the reversible figures were 
shown during test trials. There was a sig- 
nificantly greater number of nonshocked 
than shocked test responses over their en- 
tire group of 20 Ss. Smith and Hochberg 
concluded that “. the ‘punishment’ 
(shock) procedure did affect the selection 
of the figure perceived" (p. 86). They con- 
Sidered their primary problem similar to 


that of Lazarus and McCleary (1951) who 
administered electric shock to Ss in associ- 
ation with certain nonsense syllables. In a 
series of tachistoscopic test trials, Lazarus 
and McCleary then found that the average 
GSR for previously shocked syllables was 
higher than for nonshocked syllables de- 
spite incorrect recognition responses. Smith 
and Hochberg, contrary to Lazarus and 
McCleary, found explanations which did 
not involve "subception" or "perceptual de- 
fense" more consistent with their data. 

In the Smith and Hochberg experiment, 
the electric shock accompanied presentation 
of the shocked face. This is not an entirely 
satisfactory procedure since the shock may 
Serve as a cue, since it is a part of the total 
stimulus configuration, and thus function 
like any other cue during the test trials. 
Accordingly, in the present experiment the 
electric shock was presented subsequent 
to the naming response of the S. To be 
sure, this is contrary to the efforts of 
Lazarus and McCleary “. . . to avoid associat- 
ing the shock with the subject's verbal report 
and thus influencing his response prefer- 
ence during the final test period" (p. 117). 
But in the present series of experiments 
interest was focused upon the responses 
made in recognition situations and the fac- 
tors which influence these responses, and 
not in evaluation of such perceptual con- 
structs as “autism,” “subception,” and 
“really see.” Specifically, the question to 
which the experiment was directed was 
whether or not electric shock leads to a 
decrease in the tendency to use the response 
previously associated with a stimulus con- 
figuration when the configuration tends to 
evoke that response as well as others. 

In order to assess the effects of differ- 
ences in the intensity and method of admin- 
istration of the electric shock, three groups 
of Ss were run. For one group, a fixed 
shock was used for all Ss; for the second 
group, the intensity of the shock admin- 
istered to а given S depended on his verbal 
reports; and, for the third group, an at- 
tempt was made to persuade each S to take 
a higher intensity of shock than he would 
have received if he were in the second 
group. 
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Method 


Subjecis. Al Ss were students of introductory 
psychology at Indiana University. A total of 92 
Ss were run, 21 in Group VII-FS, 29 in Group 
VII-VSI, and 42 in Group VII-VS2. 

Apparatus and materials. The apparatus for the 
controlled exposure of paired associates, as de- 
scribed in Exp. I, was used. The electric shock 
was administered by a constant current device, 
calibrated in milliamperes. The maximum possible 
shock was 10 milliamperes. І 

The learning figures and test stimuli used in this 
experiment may be seen in Fig. 7; the nonsense 
syllables КОХ, VED, JOM, and NAF were 
paired with them. The stimuli lettered A and C 
in Fig. 7 were shown during the test trials. 

Procedure. The Ss in all groups were presented 
with the figures and nonsense syllables as paired 
associates during learning. A learning block con- 
sisted of the presentation of each figure-name 
pair once. Electric shock was administered to all 
Ss during the learning trials in association with 
either А; or Аз and its response, and also in asso- 
ciation with either C; ог C; and its response. The 
particular member within each of these pairs which 
was shocked was selected randomly over Ss. 

All Ss were told that “a very mild electric 
shock” would be administered and were given a 
chance to leave if they objected to the shock for 
either medical or personal reasons, An effort was 
made to dissuade from leaving those Ss whose 
objections seemed based on anxieties very little 
above the general public’s fear of electricity. 
Nevertheless, about 5% of the total available set 
of Ss were lost at this stage. 

All Ss in Group VII-FS were given a shock 
of six milliamperes. The intensity of the shock 
ee in Group VII-VS1 was set at that 

wi ich produced the verbal report indicating 
а pain threshold. Thus, after the electrodes were 
adjusted on the wrists of the Ss and before the 
experiment was started, the following instructions 
were read: 


I would like to establish the intensity of the 
electric shock that will be administered during 
the course of the experiment, As I gradually 
increase the current intensity through а series 
of trials, I would like you to tell me the level at 
which you find the shock “just painful.” Before 
each trial, I will say, “Ready” and then admin- 
ister the electric shock. Remember, I want you 
to tell me when the shock is "just painful,” 


The first level of shock administered was zero 
milliamperes, the second 2 milliamperes, the third 
4 milliamperes, and so on through 6, 7, 8, 9. and 
10 milliamperes. The series of increases’ was 
stopped either at 10 milliamperes or whenever the 
S reported pain. 

An attempt was made to get the Ss in Group 
VII-VS2 to accept a current somewhat above the 
“just painful” level for the Ss of Group VII-VS1. 
The preliminary instructions, after the electrodes 


were fastened, for these Ss were like those 
reported above for the Ss in Group VII-VSl 
However, the first shock administered was two 
milliamperes, and this was increased by two mili- 
ampere steps to 10 milliamperes. Additional com- 
ments were made during the succession of increases 
in order to make each S a little less self-conscious 
during the process. Thus, prior to the two mills 
ampere shock, E said, "Here is the lowest shoc 
level. Usually people do not feel anything at the 
first shock intensity. Ready?" And after this 
shock was administered, E asked, "Did you feci 
anything?" Such comments as, "Now the secon’ 
level,” "Feel anything?" and "How was that 
were made during the subsequent course of inten- 
sity determination. When a level of current hd 
reached after which the S reported that he cou 
take no more, or that the shock was too paim E 
or that he wanted to stop, E said, "May I БЕ 
just a little higher? If it is too high I would i 
glad to turn it back down.” After obtaining раг. 
mission, the current was increased by one m! 
ampere. Four Ss refused to try the increase, ала 
only three insisted on the reduction after the 1" 
crease was made. t 
Learning proceeded to a criterion of two pere 
blocks, a 5-min. rest period, and then ао 
block of perfect anticipations. Two test E. 
were then given with the trials necessary for е 
more perfect block coming between them. On yi 
test trials both Stimuli A and C were shown (see 
Fig. 7), in random order, to all Ss under instr 
tions to respond with the most appropriate пате. ^ 
At the conclusion of the experiment all Ss wer 


asked to state why they gave their particular r% 
sponses. 


Results and Discussion 


The mean shock intensities for the 5$ 15 
Groups VII-VS1 and VII-VS2 were 8: 
and 9.1 milliamperes, respectively. The c2 
forts to persuade the Ss in Group VILVS 
to accept higher levels of shock were ар” 
parently effective, 

The response frequencies to the test St 
uli in terms of Previous association with 
shock or not-shock may be seen in Ta" 
19. The responses are grouped into “ар 
propriate" and "inappropriate" categorie 
It is perhaps most useful to consider i? 
tially the consistent pattern of alteratio 
over all subject groups from first to seco” 
test response. In each case there is а Cle 
shift toward more shocked responses an 
fewer not-shocked Tesponses. Since 
Shock was administered during the 


0 
test 


e 
trials, it would seem reasonable to assu 
that this absence is the basis for the alte" 
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TABLE 19 


FREQUENCY OF OBTAINED TEST RESPONSES 
FOR THE Ss IN Exp. VII 


Inappropriate 
Responses 


Appropriate 
Responses 


Not Not 
Shocked Shocked Shocked Shocked 


Gp. VII-FS 
First Test 9 10 1 1 
Second Test 12 8 0 1 
Gp. VII-VS1 
First Test 5 21 2 1 
Second Test 11 18 0 0 
Gp. VII-VS2 
First Test 14 23 3 2 
Second Test 19 18 4 1 


tion. This is consistent with the findings 
of Exp. VI. 

Accordingly, primary attention was paid 
to the responses to the first test stimulus 
for each S in determining the effects of the 
previous shock upon responses to the am- 
biguous cues. It is clear that the shock of 
fixed intensity had no significant effect upon 
the responses to the first test stimuli by the 
5$ of Group VII-FS. The deviation from 
equal response fregencies for the Ss in 
Group VII-VSI, on the other hand, is sig- 
nificant at the .01 level (using the binomial 
test). In contrast, the deviation from 
chance responding for the Ss in Group 
VII-VS2 is not significant (using the x? 
approximation since the cell frequencies are 
sufficiently large). To be sure the discrep- 
ancy goes in the right direction in that more 
5$ gave a not-shocked response than a 
Shocked response. But the ratio in that di- 
rection was only about 1.5 to 1 where it was 
more than 4 to 1 for Group VII-VS1. 
Assuming that the difference between 
groups is not merely chance, the evidence 
thus indicates that the extra efforts of Е 
to persuade the Ss to accept a higher level 
of shock decreased rather than increased 
the effect of the shock. 

How can one account for this seemingly 
odd result? First, it must be remembered 
that seven Ss did not accept the one milli- 
ampere increase in shock intensity during 
the shock adjusting trials. Presumably these 


seven Ss were among those for whom the 
shock was most upsetting. To a certain 
extent at least, then, there may have been 
a tendency for the increase in average shock 
intensity for Group VII-VS2 as compared 
with Group VII-VS1 (9.1 vs. 8.2 milliam- 
peres) to be associated with Ss who were 
least disturbed by shock. 


One other factor must also be considered. 
This is related to the finding that differ- 
ences in the personalities of experimenters 
can produce differential learning effects in 
verbal conditioning (Binder et al., 1958). 
While the same E ran all three groups of Ss 
in the present case, we note the slight in- 
crease in participation of E with Group 
VII-VS2 as compared with Group VII- 
VSI. While the goal was to make each 5 
less self-conscious of the shock during the 
preliminary attempt to fix the maximum 
possible intensity, E's increased responsive- 
ness may have had a kind of psychothera- 
peutic result in ameliorating the painful 
effects of the shock. 

The comments of the Ss at the conclusion 
of the experiment as to their reasons for 
giving their responses are of some incidental 
interest. These comments indicate that the 
test responses were generally not "inter- 
pretive" in the sense that some writers on 
perception use the term: that is, there was 
no concomitant implicit relating of shock 
and response alternatives prior to naming 
the ambiguous stimulus. Thirteen of the 
total of 92 Ss in Exp. VII related their 
choice of test responses to the previous 
shock during learning, and, of these, five 
actually gave shocked responses to the am- 
biguous cues. Among the stated reasons for 
test response choice are the following 
quotes, which are slightly paraphrased—the 
numbers and symbols following each quote 
indicate whether the S who made the par- 
ticular statement gave two shocked (2S), 
two not-shocked (2NS), or one shocked 
and one not-shocked (1S, INS) responses: 
Remembered them more strongly (2S) 

First two learned (2NS) 

Liked those stimulus words bett 
(2NS) 

Paid no attention to the shock (15, INS) 

Looked more alike (1S, INS) 


Easier to remember, especially TOM (1S INS) 


er, sounded prettier 
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Didn't want to get shocked again (2NS) 

RUX always shocked and used to saying it (2NS) 

First thought of (1S, INS) 

Both circles, means nothing, so like a blank space 
(2NS) 

Knew it best, first one should have been JOM, 
I guess, because of jagged line (2NS) 

Just stuck in my mind (2S) 

Both were circles and therefore easier to remem- 
ber (1S, 1NS) 

Because these were ones shocked (2S) 

Most 5$ stated that there was no reason 

for their particular choices, and next in 

order of frequency were statements indicat- 

ing ease of remembering and priority of 

learning. Interestingly enough, and in ac- 

cord with the findings in the area of verbal 

conditioning, most Ss indicated during the 

conversation following the experiment that 

they were not aware of the particular pairs 

that were punished. 


The results of this experiment, then, in- 
dicate that, when Ss are confronted with an 
ambiguous situation in which a previously 
shocked and a Previously unshocked name 
are appropriate, they will tend to respond 
with the unshocked name, This conclusion 
stems not only from the significant differ- 
ence for Group VII-VS1, but also from 
the consistency of results (though insig- 
nificant) over all groups. 

The effects of the shock upon acquisition 
during the paired associates learning trials 
may be seen in Figs. 9, 10, and 11. (The 
Curves for Group VII-VS2 are somewhat 
curtailed since one individual made one 
error to a shocked stimulus in Block 19 and 
two errors to a not-shocked stimulus in 
Block 21.) Particularly noticeable is the 
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Fic, 10. Total errors over learning blocks fof 
Ss in Group VII-VSI. 


consistent tendency for more errors to үз 
associated with the shocked than the p 
shocked stimuli from about the third to eS 
eighth block. After approximately he 
eighth block (but varying between t i 
seventh and tenth blocks for the differen! 
groups) there does not seem to be any COT 
sistent pattern. Despite the general ёш 
sistency of these intervals in which шш. 
errors occur to the shocked than the not 
shocked stimuli, there are obvious di ed 
ences among the intervals. And these ас, 
interestingly, related to the difference 
among the groups in the ratio Берне, 
shocked and not-shocked responses tO in, 
test stimuli. Thus, the length of the inter 
for Group VII-VS1 is approximately Se 
blocks; that for VII-VS2, six blocks; ай 
that for УП-Г$, four blocks. 
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Fic. 11. Total errors over learning blocks 
Ss in Group VII-VS2. 
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A final comparison of some interest in- 
volves the proportion of errors, subsequent 
to the initial criterion of two perfect blocks, 
which were associated with shocked and 
not-shocked stimuli, respectively. These 
proportions and the frequencies which they 
represent may be seen in Table 20. The 
over-all difference of 38 shocked as against 
19 not-shocked errors is significant at the 
02 level (using x°). 

Thus, the electrical shock seemed to have 
some effect in interfering with the learning 
of the stimulus-response connections with 
which it was associated, as well as leading 
to a difference in response tendencies to 
ambiguous cues. There is no compelling 
evidence indicating that the testing differ- 
ences are accountable solely on the basis 
of the learning differences, nor the con- 
trary, for that matter. 


OvER-ALL DISCUSSION AND CONCLUSIONS 


An immediate question that arises with 
respect to the frequency results is whether 
they are accountable on the basis of the 
development of response biases, rather than 
on the basis of recognition responses asso- 
ciated with particular stimuli in the previ- 
ous learning trials. In other words, would 
comparable results have been obtained if 
blank cards had been shown during the test 
trials in place of the stimuli of reduced 
cues? Goldiamond and Hawkins (1958), 
for example, have shown that, under cer- 
tain restrictive conditions, the relationship 
between word frequency and tachistoscopic 
threshold can be explained in terms of re- 
sponse biases. In their experiment, after a 


TABLE 20 
PROPORTIONS OF ERRORS AFTER INITIAL CRITERION 
IN Exe. VII 
Not 
Shocked Shocked 
Gp. VII-FS 70% (7) 30% (3) 
Gp. VII-VS1 66.795 (6) 33.3% (3) 
Gp. VII-VS2 65.8% (25) 34.2% (13) 
Total 66.7% (38) 33.3% (19) 


Note.—Values in parentheses give actual numbers, 


series of training trials involving the pres- 
entation of nonsense syllables at differential 
irequencies, 5$ were shown blank flashes 
at tachistoscopic speeds under instructions 
to respond with one of the previously 
learned syllables on each trial. A simulated 
method of limits was used. Goldiamond 
and Hawkins suggested that their results 
indicating the operation of response biases 
are generalizable to other types of studies 
establishing relationships between frequency 
and recognition thresholds. 

There is evidence to indicate that in the 
present experiments the important fre- 
quency variable in determining test re- 
sponses is the relative frequency of occur- 
rence of a given stimulus under conditions 
in which the response in question is appro- 
priate, and not merely the relative frequency 
of that response during the learning trials. 
First, there are the very few inappropriate 
responses during the test trials, which in- 
dicates that the Ss were certainly attentive 
to the test stimuli. Second, the situations 
in which the relationship between relative 
frequencies during learning and relative 
irequencies among responses to ambiguous 
cues did not hold provide further evidence. 
Consider the Уз in Group II-C and those 
in Exp. ПТ: the relationship held with the 
latter but not with the former. Since the 
only difference in experimental conditions 
between the two groups was in the relative 
frequencies during learning (3:1:1:1 vs. 
3:2:2:1), it is difficult indeed to see how 
the discrepancy in results between the two 
groups of Ss can be explained by response 
biases, without stimulus considerations. 
(Stimulus considerations were of course 
essential to the explanation offered earlier 
for the discrepancy.) In the case of the 
Groups I-U and П-АВ to Test Stimulus A, 
the discrepancy in results was associated 
with the presence and absence of a supple- 
mentary set of figures and names (the Сг) 
during learning. The source of the difficult 
seemed to be a stimulus confusion between 
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crepancy. The C; figures and their names 
are somehow involved, and it is not clear 
how their presence or absence can affect 
response biases. 

The third line of evidence indicating that 
the results of the analysis of frequency in 
recognition are not explainable in terms of 
response biases stems from considerations 
of the actual frequencies of the various 
responses during the learning trials of the 
Ss in Exp. III. These particular Ss were 
chosen since all possible test responses were 
appropriate for them and they showed al- 
most perfect correspondence between learn- 
ing and test relative frequencies. Table 21 
Shows the number of erroneous responses, 
for each block of the paired-associates 
trials, consisting of names occurring three 
times, twice, and once per block, respec- 
tively. When an erroneous anticipation was 
of a form which was not clearly classifiable 
in terms of frequency, it was not included 
in the analysis. Examples of responses so 
excluded are GAF (derived from GEP and 
NAF), TUX (derived from TUZ and 
RUX), and VAL (derived from VED and 
KAL). In about 25% of the cases of errors 
in anticipation, the Ss gave no responses 
prior to the time the name appeared in the 


TABLE 21 


CLASSIFIABLE ERRONEOUS ANTICIPATIONS DURING 
LEARNING BLocks or Ss Іх Exe. III 


Frequency of S-R Pair 


Block 3 2 1 
2 14 65 31 
3 I3 45 32 
4 10 29 15 
5 3 19 12 
6 3 11 6 
7 3 9° 3 
8 0 3 1 
9 1 3 3 
10 0 2 0 
11 0 1 0 
12 0 1 0 
Total, 
Blocks 2-4 37 139 78 
(0.48) (1.78) (1.00) 
Total, 
Blocks 5-12 10 49 25 
(0.40) (1.96) (1.00) 


right window; these could of course not be 
considered in this tabulation. 

ТЕ the differential frequencies of test Te 
sults are explainable as response biases, 
these biases have surely developed during 
the learning trials and should be evident 1n 
the response tendencies during learning: 
Thus, whenever an erroneous anticipation 
occurs, one would expect that it is more 
likely to be a high frequency than a E 
frequency name and that this tendency WI 
increase over the learning trials. The for- 
mer is difficult to evaluate because of the 
complication that an 5 who responds to à 
stimulus figure in accord with a response 
bias and independently of the actual sti 
ulus is more likely to be correct when у 
gives a high frequency rather than a Ion 
frequency name. But, despite this com 
plication, if response biases are operat 
the trend over learning blocks should be 
apparent in the proportions of erroneous 
anticipations for the S-R pairs of differing 
frequency. Table 21 shows no such tren! à 
The numbers have been summed over bloc 
two to four as well as over all blocks from 
five on to facilitate this evaluation furtis 
These totals are also included in Table 21; 
the parenthetical values are the totals E. 
terms of the number in the right hand eg 
umn as a unit. There is surely no evidenc 
for the operation of response biases b 
these data. а 

As а final indication of the important 
of stimulus factors in determining test 16 
sponses we have the data on selective 52” 
pling. The Ss in Group IV-C-NE we 
those in IV-C-NE differ markedly in thes 
proportions of various test responses, ye 
both groups gave the same average numbe j 
of all of the various responses during leat, 
ing. Similar comparisons may be mos 
among other groups in Exp. IV that ae 
fered in either specific test stimulus in 
errors during criterion trials, but nof, g 


response frequencies during the leaf? 
trials. 


m- 
he 


e 

Turning now to another aspect of a 
frequency analyses of responses to a” fe 
uous cues: it is perhaps not unreason@ m 
to assume on the basis of the data Иа 
groups that an individual's probability 0 
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given recognition response to a set of am- 
biguous cues will. be equal to the relative 
frequency of his previous S-R experiences 
with the named stimulus object. 

The findings are especially interesting 
when consideration is given to the deduc- 
tions and empirical evidence stemming from 
statistical learning theory. Estes and 
Straughan (1954) used an experimental 
Situation involving a given set of stimulus 
elements and two response classes, with 
respective probabilities m and 1-7 of being 
reinforced. They demonstrated mathemati- 
cally and empirically that asymptotic prob- 
ability of response is equal to probability of 
reinforcement under their specified condi- 
tions. Burke and Estes (1957) later showed 
that statistical learning theory predicts that 
response probability approaches reinforce- 
ment probability regardless of the distribu- 
tion of sampling parameters (Estes and 
Straughan had assumed them equal for all 
elements). Finally, Estes (1957b) has pre- 
sented the most general asymptotic match- 
ing formulation, showing that cumulative 
response proportion tends toward cumula- 
tive reinforcement proportion no matter 
how the latter varies over a series of trials. 
Moreover, in this presentation Estes allowed 
any number of alternative response classes. 

Since the Ss in the present experiments 
were required to correct all wrong anticipa- 
tions upon exposure of the response sylla- 
bles, the present experiments satisfied the 
necessary conditions of statistical learning 
theory that each possible response must 
have an associated reinforcement which, 
when present on a given trial, assures that 
the response will terminate the trial. Using 
the terminology of the statistical learning 
theorists (see Estes, 1957b, for example), 
the nonsense names appearing in the right 
windows of the present experiments cor- 
respond to the set of reinforcing events 
(E; Ez ..., E,), the naming responses 
of the Ss to the set of alternative responses 
(Ay, Аз, ..., Ау), and the relative fre- 
quency of occurrence per trial block for the 
various stimulus-response pairs to the rein- 
forcement probabilities (71, Ta, . . . , 7, ). 

Working within the framework of sta- 
tistical learning theory, Peterson (1956) 


used an anticipatory technique to establish 
two verbal response hierarchies by associat- 
ing responses consisting of English nouns 
with pairs of nonsense syllables used as 
stimuli. One of the syllables in each non- 
sense stimulus pair was common to another 
stimulus pair and thus conditioned to two 
noun responses, while the other syllable was 
unique and conditioned to a single response. 
He found, in word association test trials 
involving the presentation of the ambiguous 
nonsense syllables (those common to two 
noun responses), that the observed relative 
frequency of response was equal to rein- 
forcement probability (as corrected for the 
slight dissimilarity between learning and 
testing situations). 

Schoeffler (1954) used test patterns com- 
posed partly of stimuli to which a given re- 
sponse had been conditioned, partly of stim- 
uli to which a competing response had been 
conditioned, and partly of stimuli to which 
neither the given nor the competing response 
had been conditioned. His stimuli consisted 
of light patterns selected from a group of 
24 lights, and his responses of movements 
of a lever to the right or left. He found 
that the probability of a response to a test 
pattern is equal to the proportion of stim- 
ulus elements in the test set that are condi- 
tioned to the response. It should be noted 
that Schoeffler produced ambiguity in his 
test stimuli by presentations containing too 
many cues. 

The finding in the present experiments of 
a close correspondence between relative fre- 
quencies of presentation during learning 
(reinforcement probabilities) and relative 
frequencies of test responses provides gen- 
eral support for the asymptotic expecta- 
tions based on statistical learning theory. 
However, it must be noted for the experi- 
ment involving the faces (Exp. У) ‘that, 
since all cues were given 100% reinforce. 
ment, the analyses of Estes and Straughan 
(1954), etc. do not lead necessarily to the 
prediction of matching of test response and 
learning relative frequencies—that is, 
assumes a. high degree of selective за 
such that the common eleme 
paratively low probability 
Deviations from 


if one 
mpling 
nts have а com- 
v of being sampled. 
matching would be ey. 
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pected to be in the direction of equal fre- 
quencies between the two alternatives 
(which is in accord with the results shown 
in Table 15). But the status of the common 
elements in this particular experiment is 
not at all clear because of their central, 
attention-getting characteristics. 


Let us consider the other general result, 
namely, that Ss who learn to make discrimi- 
nating responses indicating recognition on 
the basis of configurations can make accu- 
rate discriminations using parts of these 
configurations. In only one of the six 
comparisons in Exp. I did the frequencies 
of appropriate and inappropriate responses 
to the test stimuli differ significantly from 
the frequencies expected purely as а result 
of the 5s not having achieved perfect learn- 
ing levels. Moreover, the results of the sub- 
Sequent experiments seemed to confirm this 
general result. 


"These results of appropriate and inappro- 
priate response frequency have important 
implications for conceptualization in statis- 
tical learning theory. They indicate the 
limitations of the approach involving com- 
ponent elements where stimulus events or 
elements are equated with cues and no al- 
lowance is made for the interaction of cues 
or the discriminations based upon configura- 
tions. Thus, a component approach would, 
in the present experiments, involve identi- 
fying various cues (circle or triangle, wavy 
or straight line at top, slanted line in right 
or left corner as in Fig. 1) or their parts 
as stimulus elements without assigning 
Stimulus elements to patterning effects 
( such as a triangle with а wavy line above 
it). The stimulus elements associated with 
the pattern would be the stimulus elements 
identified separately with the components 
of the pattern and nothing more. 

Estes (1957a) has questioned the ade- 
quacy of the component approach in sta- 
tistical learning theory and pointed out the 
improvement in predictions resulting from 
considerations of pattern or configuration. 
In terms of the present experiment, statis- 
tical analysis of the stimulating situation 
based upon components leads to the pre- 
diction of many more inappropriate re- 
sponses than were actually obtained, Let us 


demonstrate this under the assumptions that 
all stimulus elements had been perfectly 
conditioned prior to test trials and that the 
Ss had learned to sample selectively early 
in learning. The present argument woul 
be even stronger (though more difficult to 
present) without these assumptions. . 
Consider the stimulus elements associ- 
ated with exposure of Test Figure А (see 
Fig. 1) in accord with a component-element 
approach, using Group I-E as an example. 
One-fourth of the stimulus events sampled 
in observing the triangle have been con- 
nected to the name of Figure А,. One- 
fourth of the events have been connected to 
the name of А», another fourth to the name 
of Сз, and the final fourth to С.. Similarly, 
the stimulus events associated with the line 
above the triangle in A have been evenly 
conditioned among the responses to Ai, A», 
Bi, and Bs. There would seem to be no 
basis for assuming that events connected 
with the names of A, and А» are any more 
likely to be sampled on a given testing trial 
than other events. Thus, since the proba- 
bility of a given response is considered equal 
to the proportion of stimulus events in the 
population which are conditioned to the re- 
sponse, the probability of the name of A, 
to A is 1; the name of As, 1; either the 
name of B; or the name of Су, +; and either 
the name of B; or the name of C,, i. This 
leads to the expectation of 50% appropri- 
ate and 50% inappropriate responses to 
Test Stimulus A. A similar analysis may 
be made for Test Stimulus В for the Ss in 
Group LE. A comparable analysis for the 
Ss in Group I-U involves Some complica- 
tions due to the unequal trial-block fre- 


quencies, but can be expected to produce 
similar results, 


Thus, some of the results which would be 


predicted on the basis of a Stimulus-com- 
ponent approach to Statistical learning 
theory are at considerable variance with 
the obtained results. This clearly indicates 
the desirability of component interaction OF 
in other words, patterning considerations 
| Тһе Same data also indicate the dangers 
In going too far in configurational emphasis 
The findings on appropriate and inappro" 
priate responses raise questions for the 
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gestalt interpretation of the phenomena of 
recognition. Thus, Koffka (1935) states: 


In applying the law of similarity to the selection 
of a trace by a process one has to exercise great 
care. One must bear in mind that similarity must 
exist between, and therefore be defined in terms 
of, the process and the trace; hence similarity or 


partial identity between the two sets of stimuli 
which give rise to the formation of the trace and 
the process now occurring is лоѓ an adequate 
criterion for the application of the law. And there- 
fore any explanation like the following one is 
essentially wrong: If an organism has reacted in 
a certain way to the stimulus complex A В С 
D ЕЕ, it will later respond in the same way to the 
recurrence of a part of this complex, say B С; 
. . . We know that the response {о а stimulus 
complex is not the sum of all responses to its 
individual components, but an organized pattern 
in which each part depends upon the organization 


of the whole (p. 601). 

Koffka does, however, allow set or atti- 
tude much influence in producing commu- 
nication between "process and trace." As 
an example of this he shows how, under 
the influence of proper set, a previously 
experienced figure is seen as part of the 
stimulus complex of a more complicated 
figure. Perhaps a gestalt psychologist might 
argue that, since the 55 had to make dis- 
criminations on the basis of all cues in the 
complex, this produced an attitude which 
enabled them to discern and utilize the re- 
lationship between stimulus complex and 
stimulus part. 

In summary, the results of the experi- 
ments dealing with learning frequency pro- 
vide very convincing evidence of a direct 
relationship between learning frequency 
and perceptual response frequency. More- 
over, these results provide a set of condi- 
tions under which the relationship breaks 
down. In addition, the matching of learning 
and recognition frequencies was useful in 
the investigation of selective attention in 
perceptual learning. It was found in this 
experiment that Ss who maintain errorless 
responding after attaining initial criterion 
levels tend to respond on test trials in a 
manner which indicates selective attention 
to cues during the process of learning per- 
ceptual responses. For these individuals, 
common and discriminative cues among the 
stimuli during learning differ in their avail- 


ability for association with the various re- 
sponses. 

Surely it is clear that a good deal of addi- 
tional evidence must be accumulated before 
the phenomena of selective attention in rec- 
ognition become clarified. Further work 
might also very profitably be aimed at the 
precise specification of the conditions under 
which the relationship between relative fre- 
quency of occurrence during learning and 
relative frequency of perceptual response 
does and does not hold. This would seem to 
have important implications not only for 
an analysis of such ambiguous stimulus 
phenomena as tachistoscopic and projective 
presentations, but also for statistical learn- 
ing theory. Indeed, the results obtained 
here provide at least a partial bridge be- 
tween the two areas. 

The results of the experiments involving 
reward and punishment during learning are 
not as clear or compelling. Surely the indi- 
cations are that shock in association with 
a recognition response during learning 
makes that response less likely to occur than 
another appropriate response for an am- 
biguous cue configuration. The experi- 
menter's response of "Good" apparently has 
no comparable influence in determining re- 
sponses to ambiguous cues. 

An important consideration in studies on 
the effects of reward and punishment upon 
perception is the point at which the rein- 
forcement occurs during the learning trials. 
Proshansky and Murphy (1942) apparently 
presented their rewards and punishments 
approximately simultaneously with their 
various stimuli, although the exact timing 
is not made clear in their article. Moreover, 
the Ss made no overt responses during the 
training period. Schafer and Murphy 
(1943) report that reward and punishment 
came after S had repeated aloud the name 
which the experimenter had presented 
orally just prior to exposure of the face. 
Тре sequence in the case of the experiments 
of Rock and Fleck (1950) and Jackson 
( 1954) was as follows: name told. by ex- 
perimenter, stimulus presented, S repeated 
name, and, finally, reward or punishment 
given. Apparently, Solley and Sommer 


(1957), however, presented the stimulus 
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first, then pronounced the name for the 5, 
and administered reward at some time after 
the face appeared. No specific mention is 
made of whether the 55 did or did not pro- 
nounce the names during the training series, 
but miscellaneous comments in their article 
give the impression that the 5$ did not do 
so. Smith and Hochberg (1954) adminis- 
tered shock simultaneously with the pres- 
entation of the stimuli. For the first 12 
trials, the name was given to each $ prior 
to each presentation, and thereafter 5 called 
out the name of every figure after it was 
shown. 

The administration of reward and/or 
punishment after exposure of the stimulus, 
but prior to the response of the S (whether 
overt or not), does not seem reasonable if 
one is interested in the reinforcement as- 
pects, rather than the cue aspects, of the 
reward and/or punishment. This is most 
obvious in the case Where either reward or 
punishment is associated with one stimulus 
and nothing with the other stimulus of an 
ambiguous team, The finding of Smith and 
Hochberg, for example, of significantly 
more unshocked than shocked responses to 
the ambiguous test stimuli is explainable in 
terms of the absence of the electric shock 
а5 а сие for one of the two possible re- 
Sponses. One further difficulty in many of 
these studies is that the relationship between 
reward and punishment, on the one hand, 
and the conditions of learning, on the other, 
is obfuscated because the correct responses 
Were given either prior to ог immediately 
after exposure of the stimuli, giving the 
Ss no chance for anticipation and subse- 
quent reinforcement or Correction. (There 
is evidence, as one might expect, that the 
5s in these experiments showed very ineffi- 
cient learning, as demonstrated by enormous 
numbers of erroneous test responses, This 
is discussed by Rock and Fleck, 1950). 

But associating reinforcement with re- 
sponse in paired-associates learning, as in 
the present experiments, is objectionable to 

those who argue that such a procedure may 
affect the response but not the perception. 
To illustrate this position, in the case of 
their experimental situation involving shock, 
Smith and Hochberg (1954) distinguish be- 


tween "autism" and "inhibition of the 
verbal ‘naming’ response,” and indicate that 
their procedure produced the autistic effect 
without verbal inhibition. | 

The problem of differentiating what is 
perceptual апа what is purely verbal in а 
perceptual-type experiment has been par- 
ticularly vexing to research workers in this 
area. One attempt at a solution to this 
problem has involved complete operational- 
ism, where perception is defined in terms of 
the experimental conditions and recognition 
as the association of a correct name with 
а given pattern. As Estes (1959) has 
pointed out, the difficulty with this opera- 
tional viewpoint “. . . is that, as always 
when one puts on blinkers, we would be in 
danger of failing entirely to see important 
aspects of the problem" (in press). How- 
ever, investigators like Murphy, Schafer, 
Proshansky, Pastore, Smith, Jackson, Hoch- 
berg, etc. have surely not been guilty of 
moving too far in the direction of opera- 
tionalism. In fact, some of their statements 
seem to indicate that a few have given per- 
ception or the "really see" process a scien- 
tific reality independent of the observations 
Which provide its anchoring. For example, 
in his argument that the experimental tasks 
of Proshansky and Murphy (1942) were 
judgmental and not perceptual in nature, 
Pastore (1949) states: 

In order to subsume the experiment under the 
Category of perception, the authors must demon- 
Strate that the visual experience of the subjects is 


actually changed ; the rey arded lines must be actu- 


ally magnified in the visual experience of the sub- 
Jects (p. 470). ' 


to provide anchoring for 
inferring perceptual and not 
purely verbal processes. Thus, їп support 
of their viewpoint that the Ss “actually 
Perceived” in accord with the training: 
Proshansky and Murphy (1942) point ott 
that their Ss frequently complained of the 
difficulty of perceiving the stimuli, that the 
Ss displayed a good deal of interest in ше 
stimuli, and that the Ss responded carefully 
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and not with undue haste. Similarly, in 
arguing for a genuine reorganization of the 
perceptual field on the part of their children, 
Solley and Sommer (1957) state that the 
Ss reported the reward-associated figure as 
phenomenally nearer and as having a dark 
line around it, producing contour emphasis. 
The fact of the matter is that so little is 
actually known about the perceptual process 
and its indicators that many of the discus- 
sions on the perception vs. naming problem 
are purely in the realm of polemics. Cur- 
rent theories on the factors involved in 
making differential responses to stimulus 
configurations are quite inadequate and 
desperately in need of clarifying data. All 
investigators involved in perceptual-type re- 
search are interested in the responses of Ss 
under certain rather loosely defined condi- 
tions. And, from the viewpoint of emphasis 
received in the literature, surely the naming 
or identifying response is most important. 
With the development and maturation of a 
theory, perhaps other types of responses 
will be equally well investigated for differ- 
entiation among higher-order constructs, 
but the groundwork must be firmly laid in 
terms of the more basic data. The experi- 
ment of Neisser (1954) is illustrative of 
the process of gradual elaboration of a 
relatively narrow perceptual concept upon 
the base of a well-established core. After 
making a rough distinction between what 
an S sees and what he says, he attempts to 
show that familiarity with words facilitates 
"seeing rather than saying the words" in 
tachistoscopic presentations. Defining his 
concepts in a very restricted fashion, he 
demonstrated that prior experience with 
words lowered the tachistoscopic thresholds 
for these words but not their homonyms. 
The present research project has been 
concerned broadly with those factors in the 
process of learning differential responses 
to stimulus configurations which affect 
naming responses to ambiguous cue sets. 
In accord with most of the research in the 
area of perceptual processes, the test re- 
sponse variable studied was naming or iden- 
tifying. Thus, returning to the point of 
departure for this particular discussion, the 
question of whether the response or the 


perception is affected by reinforcement ap- 
plied during acquisition of the naming re- 
sponses is not particularly meaningful at 
this rather crude stage of perceptual theory. 

One surely gets the over-all impression 
from these results în toto of the greater 
significance of frequency of occurrence than 
of differential reward and punishment in 
influencing responses to ambiguous cues. 
The most extreme ratio of one test response 
to the other possible response was 21 to 5 
(see Table 19) in the case of the experi- 
ments on reward and punishment, but that 
order of ratio was obtained repeatedly in 
the frequency experiments when one learn- 
ing stimulus appeared four times as often 
as its testmate (see Tables 3, 7, and 17). 
Beyond that there are the precise predic- 
tions possible with learning frequency as 
the independent variable and the corre- 
spondence of results with the theorems of 
a well-formulated theory, advantages which 
do indeed seem to be a long way off in the 
case of reward and punishment. 


SUMMARY 


The over-all aim of this research was to 
investigate the effects of variations in the 
conditions of learning upon responses to am- 
biguous stimuli. The learning consisted of 
trial blocks of figure-name pairs learned 
to a criterion by the method of paired asso- 
ciates. Test trials, in which the Ss were 
presented with cues common to previously 
learned figure groups, followed the learn- 
ing trials. The Ss were instructed to re- 
spond to the ambiguous cues with the one 
of the previously learned names which 
seemed most appropriate. 

The learning variable of interest in the 
first set of experiments was relative fre- 
quency of occurrence. Eight groups of Ss 
were run in five experiments. For seven of 
these groups the learning figures were com- 
posed of arbitrarily chosen cues and con- 
sequently meaningless. The learning figures 
for these groups contained common as well 
as differentiating stimuli, and the test stimuli 
consisted of only the common cues. For the 
remaining group the learning figures con- 
sisted of line profiles of human faces with 


до 


the lines continued to form semicircles. The 
profiles were constructed in pairs having 
common contours, and the test stimuli con- 
sisted of these common contours in a full 
circle. 

Among the groups whose learning figures 
were completely meaningless, two groups 
were each shown a total of eight figures per 
block. For one group the figures occurred 
at equal frequencies per block, while for the 
other group the frequencies per block for 
the various figures were (2:1) : (4:1) : 
(3:1:1:1) (where the parentheses show the 
frequencies of figures having common cues 
on a test trial). Three other groups in the 
category of those shown meaningless learn- 
ing figures were shown four figures, at fre- 
quencies, respectively, of (2:1) : (4:1), 
(3:1:1:1), and (3:2:2:1). 

The responses to the test stimuli were 
either consistent with the test cues, in the 
sense that the response names were of learn- 
ing stimuli which contained these cues, or 
the responses were names of learning stim- 
uli having cues which conflicted with the 
test cues. A hypothesis stemming from a 
Statistical model of visual recognition was 
supported by the finding that the relative 
frequencies among the responses consistent 
with the ambiguous testing cues tended to 
equal their relative frequencies of occur- 
тепсе during learning. However, there 
Were two test stimuli for which this match- 
ing did not hold: the (2:1) test for the 

(21) : (41) : (3:1:1:1) group and the 
test for the (3:1:1:1) group. Hypotheses 
for these discrepancies were offered and 
deductions from them confirmed. Further, 
а second major hypothesis that the learning 
of discriminatory responses results in the 
ability to identify subsets of the learned 
cues was generally confirmed. Recency as 
an explanation for the frequency findings 
was ruled out. 

The remaining two of the seven groups 
shown nonsense learning figures were used 
to test the hypothesis that Ss learn to ig- 
nore selectively the nondiscriminative cues 
among the stimuli in a multiple discrimina- 
tion situation involving recognition re- 
sponses. Use was made of the finding that 
55 tend to give recognition responses to 
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test cues at the same relative frequencies аѕ 
the occurrence of these cues as components 
of various previously seen learning figures. 
The test analyses included the naming Te- 
sponses of one group of Ss to a stimulus 
containing the cues common to all learning 
figures, as well as the naming responses of 
another group to stimuli containing both 
discriminative and common cues. Both 
groups were shown the same four learning 
figures equally frequently per block for the 
first three blocks, and then at respective 
frequencies of (2:1:4:1) per block through- 
out the remaining learning trials. 

Only those Ss who made no errors in the 
learning trials subsequent to the initial cri- 
terion level were used in the test of the 
hypothesis of the selective use of cues. 
Among these Ss, those who were shown the 
test stimuli containing both common and 
discriminative cues responded with the 
names of figures at relative frequencies not 
significantly different from the relative fre- 
quencies of occurrence of these figure- 
name pairs during the learning trials of 
differential frequency of presentation. How- 
ever, those Ss who were shown the test 
stimuli containing only common cues re- 
sponded with names at relative frequencies 
which differed significantly from those ex- 
pected if differential learning frequencies 
were matched, but did not differ signifi- 
cantly from the frequencies expected if the 
Common cues were selectively ignored БУ 
the fourth learning block, 

The group which had the profiles as 
learning figures were shown four 0 
them at frequencies of (2:1) : (4:1). The 
matching between relative frequencies © 
occurrence during learning and relative 
frequencies among test responses held #07 
their first response but not for their Se 
ond; lack of independence resulting fr 


the particular nature of the stimuli W? 
hypothesized. 


The learning variables of interest in m 


second set of experiments were reward 
punishment. Four groups of Ss were ma 
and four learning figures were show! ! 
each group. The learning figures were Р 
pairs in the sense that reward or puis 
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ment was associated with one member of 
a given pair and no reward or punishment 
with the other. Reward or punishment fol- 
lowed response. Two test stimuli, one for 
the cues common to the members of each 
pair, were shown to the Ss. 

One group received “Good” as reward, 
while the other three groups received elec- 
tric shock as punishment. These latter 
three groups differed principally in average 


intensity of shock. No significant differ- 
ence was found between the number of re- 
sponses which were previously rewarded by 
“Good” and the number not so rewarded. 
However, a tendency was found for Ss to 
give test responses which were not previ- 
ously shocked, at least in their first re- 
sponses. The difference was significant for 
one group (medium shock intensity) and 
in the given direction for the other two. 
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OR too many years we psychologists who 
Eee interested in verbal learning have 
been using scales of meaningfulness of 
. questionable validity. The most frequently 
referred to study of the association value of 
А nonsense syllables has been that by Glaze 

(1928). Especially since only its results 
| 


have been reported in а more readily avail- 
able secondary source ( Hilgard's chapter in 
Stevens, 1951), the shortcomings of the 
"original study are usually unknown. The 
Glaze study had but 15 subjects (Ss); 
many nonword, three-letter combinations 
. were omitted from the listing; 29 items had 
two association values (which also raises a 
question of the precision of estimation of 
_ the single-valued items); and the manner 
| of S's responding may have placed some 
differential constraints on reporting associ- 
ations since 5 had to verbalize them to the 
experimenter (E). Another and slightly 
more recent study has been used somewhat 
less (Krueger, 1934). This study had some 
advantages over Glaze’s, but it also had 
some inadequacies. Although a total of 
586 Ss served in it, not all Ss judged all 
syllables; instead, each association value 
was based on 200 Ss. And although the Ss 
wrote the syllables and their responses to 
cach, they also had in view up to 49 of their 
Previous responses. The considerably higher 
values reported by Krueger might have been 
due 10 the recognition of associations among 
the items in view and to the fact that Ss 
were allowed considerably more time to 
respond than in either the Glaze or the 
present study. Furthermore, the first 100 
E MEM " 

1 The writer is indebted to Nancy L. Westby, 
who gave considerable assistance in the preparation 
of the materials and in the running of the subjects, 
and to the Numerical Analysis Laboratory of the 
University of Wisconsin, without whose facilities 
this study would have been impossible. 
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responses were considered practice, and 
apparently the responses to these were dis- 
carded. As will be evident from the present 
study, the most serious fault was the failure 
to take into account the wide differences 
among individuals. If the final values do 
not represent the averages of all Ss, the 
association values for individual items will 
not be comparable. 

An additional reason for the present re- 
evaluation is that, since the above studies 
were made, ап indeterminate number of 
so-called nonsense syllables have become 
acceptable words, slang expressions, and 
abbreviations. 

This study is a re-evaluation of the mean- 
ingfulness of all possible three-letter com- 
binations of the Roman alphabet of the 
form consonant-vowel-consonant, with the 
restrictions that the two consonants are 
different and neither is ау when у is the 
vowel. Because it seems incongruous to 
perpetuate the self-created paradox of talk- 
ing about the meaningfulness of nonsense 
syllables, it is herewith proposed to call 
these combinations "trigrams," and in view 
of their form to further specify them as 
“CVC trigrams.” 

Since the determined meaningfulness 
might be a function of the verbal fluency of 
the Ss and furthermore the two sexes might 
respond differently to a particular trigram, 
the fluency of the Ss was measured and 
cognizance taken of the sex of the re- 
spondents. 


PROCEDURE 
Materials. There are 2.480 possil 
binations of three letters of 
alphabet which meet the res 
es above. These trigrams were arranged 
m eight groups of 300 ; | Г 
ght groups of 300 and one group of 80. 


Ме com- 
the Roman 
trictions men- 


Р 
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The orders were partially determined by 
Glaze's values. In order to prevent the 
establishment of artificial anchoring points, 
two restrictions were imposed on the 
Sequence of presentation of the items: (a) 
Where possible, successive items were of 
different association values and (5) succes- 
Sive items had no letters in common. Each 
of the nine sets of trigrams was made into 
а 16-mm. film strip with a blank space be- 
- tween successive items, 


Method of presentation. The film strips 
Were presented to groups of Ss which varied 
in size from 12 to 24. The order of presen- 
tation of the film strips was determined by 
the rows of two different 9 X 9 latin 
Squares, i.e., there were 18 sequences of 
film strips. Therefore, although the order 
of presentation of items was fixed within a 
film strip, the order of film strips was varied 
to prevent the establishment of artificial 
anchoring points. To obtain a check on re- 
liability of judgment the first film strip was 
repeated after all others had been presented. 
Therefore $ saw 2,780 or 2,560 items, de- 
pending upon the length of his first film 
strip. 

The film Strips were presented at a rate 
of 2 sec. per frame, except for the first 15 
items in the first Session. Since there was a 
blank space between successive trigrams (to 


judgments), the 
esponding was 4 
in the first session 


rame in order to assure that all Ss could 
learn the required responding routine, 
To аа: „бы keepi 


signaled. The Scoring sheet had 30 15-item 
columns. In the blank space immediately 
preceding the first trigram a large “1” was 
printed in ink directly on 
the fifteenth 
“16” 


ground, while the trigrams were white on 
a grey background, If, in spite of the special 
numbering, 5 lost his place, his instructions 


were to “get in step” at the start of the 
next block of 15. After the complete show- 
ing of the film strip a specific Section was 
repeated if needed. This was seldom wed 
sary. The signaling system seemed а E 
quate, and the Ss proved quite adept 
keeping their places. і 
da ‘IBM fess with 300 Y-N -—— 
spaces was issued to each S before the sg 
of each film strip. The sheet bore the num 
ber of the film strip he was about to see. 
Each 5 recorded his name, date, age, M 
sex on the sheet before the film strip was | 


Wisconsin students from several шо 
tory psychology courses served in the 5 ird 
For various reasons the data for 16 53 H 
discarded. The final, otherwise unselec ^ 
sample had 185 women and 134 men. fil 
cause the sequence of presentation of uta- 
strips proved significant, the final сотр 
tions are based on 18 matched groups 9 
Ss each (N = 216). In this final mue 
the mean ages of the men and women W 

s im 


shown. | E 
Subjects. A total of 335 University 


20.8 and 19.2 years, respectively. à 

Task. In order to maximize the 5 
terest and motivation, the author (who Ss 
was a course instructor for many of the or- 
Save а 15-min. explanation of the ШТ t 
tance of the study, emphasizing why 8 re 
саге should be taken in making each re- 
Sponse. Before the first film strip was P 0 
sented, the Ss were told that they bow. 
pronounce each trigram {о themselves 7 5 
ask themselves the following questions: 
it a word? Does it sound like a word? 

^ 


it remind me of a word? Can I use it 1? 


й af 
Were instructed to 18, 


n i 
the scoring sheet in the “Yes” اوی‎ 
they answered “Yes” to any of the d" to 
tions. Tf they could not answer “Yes 


any of the questions. they were to m@ 
mark in th 


€ u ie 
1 е “No” column on the s€ 
+ Sheet, iyen 
jve | 
After each film strip the Ss were £i 


ое 
а 
k 
sentence ?” They f 


ке? 
ino 


a »-min. rest. Before each subsequent, of 
strip the author again reminded е 7^. 
the criteria for Scoring an item as to 
ingful. In addition, they were ит the 
maintain their standards througho™ , оп 
Study. The Ss saw 900 ог 980 triga” 


A] 
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each of three consecutive days: Sixteen of 
the groups were run in the evening; two 
were run in the afternoon. 

Fluency test.. After the showing of a film 
strip on the second day, the Ss took a short 
verbal fluency test. The Ss were instructed 
to write all of the words in the English 
language they could think of which began 
with the letter "s" and which were not 
proper nouns. They were allowed 5 min. at 
this task. Any questions about the task were 
deferred until the end of the experiment. 

General conditions. The study was con- 
ducted in a classroom; and, to minimize 
interaction among Ss, they were encouraged 
to spread out. The trigrams were projected 
onto a screen at the front of the room. 
For the evening sessions, only the overhead 
lights in the rear half of the room were 
used. This provided enough light to mark 
the scoring sheets, but the screen was dark 
enough to make the trigrams legible. The 
projected trigrams were about 5 in. high. 

Either the author or Westby was in the 
room at all times during the showing of a 
film strip. The Es deliberately showed 
interest in the Ss' work so as to main- 
tain motivation and interest. After the 
first 1,000 or so trigrams, judging of mean- 
ingfulness got to be a less than exciting 
task. If, however, any S showed evidence 
of carelessness, persistently communicated 
with a neighbor, or was unable to keep his 
place on the recording sheets, his data were 
discarded. There were 16 such cases in all. 
In a few cases $ would accidentally skip a 
column in his recording. The standing in- 
structions were to continue the task and 
record the last column on the reverse side 
of the scoring sheet. The record was then 
transcribed correctly by the Es. 


RESULTS AND Discussion 


Fluency. The correlation between num- , 
ber of words written on the fluency test and 
number of trigrams recorded as being 
meaningful (as defined previously) was 
computed for men and women separately. 
The obtained values were .128 (for 134 
men) and .138 (for 185 women). Neither 
correlation is significantly greater than .000. 


If, however, the correlation is computed 
without regard to sex of the respondent, 
the value is 0.142 (N = 319),.05 > p > .01. 
The higher correlation is due to the in- 
creased range of fluency scores. The overall 
range of fluency scores was from 19 to 94 
(о = 13.63, М = 53.01). 

Incidentally, there was a significant dif- 
ference between the two sexes on the flu- 
ency test. Women wrote 55.4 words in 5 
min., while men wrote only 49.7 (1 = 3.672, 
317 df, p < .001). 

Meaningfulness. Before determining the 
percentage of Ss reporting a trigram as 
meaningful, it was necessary to check on 
the effect of two potential variables: 
sequence of viewing and practice. Even 
though the film strips were made so as to 
minimize any anchoring effects, there was 
the possibility that the sequence of viewing 
the film strips could lead to differing levels 
of judgment of meaningfulness. In addi- 
tion there was the possibility that the cri- 
terion of meaningfulness used by the S 
might change as a function of practice in 
the study. 

In order to test the effects of sequence 
and practice, the data from 18 groups (cor- 
responding to sequences of film strips) of 
12 Ss each were assembled. Although flu- 
ency was not highly correlated with the 
reporting of a trigram as meaningful, the 
groups were assembled so as to have nearly 
equal levels of fluency; and although it was 
not possible to have an equal number of 
each sex in each group, the total of 216 
data sets were for an equal number of men 
and women. 

The range of average number of "Yes" 
responses for the 18 sequences was from 980 
to 1,678. In an analysis of variance the 
differences among the 18 sequences were 
significant (F — 3.664, 17 and 198 df, p 
< .01). Two conclusions are possible : (a) 
the groups were unequal in their disposition 
to judge the trigrams as meaningful and/or 
(b) the particular sequences of viewing 
altered this disposition to judge the item as 
meaningful. 
| In either case it seemed imprudent to 
ignore the effect of sequence and use a dis- 
proportionate number of Ss from any one 
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Sequence. Therefore all sequences were 
given equal weight, and only the data for 
the 55 appearing in the just described split- 
plot analysis of variance were used in the 
final determination of association values. 
Regrettably this meant using the data for 
only 216 Ss rather than the 319 cases avail- 
able. Though this may appear to be grossly 
inefficient, it seemed desirable in order to 
assure a reasonably good unbiased estimate 
of association value. The loss in precision 
Was not very great: assuming an association 
value of 50%, the Tg 15 3.4 for n = 216, 
whereas it would be 2.8 for № = 319. Even 
the former is a considerable improvement 
over Glaze (oe = 12.9, for М = 15). 
Practice in making the required judg- 
ments also proved to have a significant effect 
upon the likelihood of regarding a trigram 
аз meaningful. The mean percentages of 
trigrams judged meaningful as а function 
of the order of viewing film strips were: 
54.1, 53.6, 53.5, 52.8, 53.6, 54.4, 56.1, 57.0, 
and 56.8, from first to last, respectively. 
Though the change was not great, it was 
significant (F = 5.150, 8 and 112 ар, р < 
001). This result suggests that S’s criteria 
shifted with practice ; and since the terminal 
values are higher than the initial ones, it 
may also be that 5 became more imagina- 
tive and willing to judge an item as mean- 
ingful, 
Because of the matching operations per- 
formed in constructing the sequence groups, 
It would be meaningless to test for а sex 
difference in association value for the 
matching group data, Probably a better 


test is 19° compare the initial, unselected 
sample of 319 cases. Women were slightly 
more inclined to judge a trigram as mean- 
ingful than were men, 1,402.7 уз. Ыс 
respectively. The difference was insigni- 
cant, however (¢ = 1.024, 317 df, p > 05). 

To illustrate the nature of the individual 
differences observed in the present study, 
the frequency distribution of Ss e. 
various numbers of trigrams as meaning? 
(out of a possible 2,480) is shown in Fig 
ure 1. For simplicity the data were groupe 
in intervals of 50. 


FREQUENCY _ _ 
menoon 


2223 
56789101 12131415 16 17 16 19 202! p 
NUMBER OF TRIGRAMS JUDGED MEANINGFUL» 
aber 


Fic. 1. А frequency distribution of the nurple 
of trigrams judged meaningful (maximum poss 08 
was 2,480) for the 216 Ss (108 men am 
women) whose data appear in the final tabula 
(Since there was no overall difference betwee? 
sexes, a single distribution is shown.) 


tions: 


The main results of the present study “fe 
shown in Table 1. The trigrams are ai 
ranged alphabetically within each level at 
association value found ш the ae 
study. All of these values are based 0n *' 
responses of 108 men and 108 women- 


TABLE 1 
ASSOCIATION VALUES or TRIGRAMS 

Trigram Archer Glaze Krueger | Trigram Archer Glaze "T || Trigram Archer Glaze кешеге 
ХҮН 1 0 19 ХҮ] 2 0 2 XEJ 3 0 16 
XAZ 2 13 39 хүр £ б Р т : С 3 

х XOV 

n А Т 3 GYQ 3 0 3 XUF 3 13 2 
an - 7 A QUJ 3 0 2 XU] з 13 в 
т 2 ОУ 3 0 18 XUQ 3 7 29 

3 0 36 | XEF з о 23 | xuy з 0 

а Significant. sex difference, 


y 
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TABLE 1—Continued 


RI 


Trigram Archer Glaze Krueger Trigram Archer Glaze Krueger Trigram Archer Glaze Krueger 
XYF 3 0 2 XUD 6 7 51 ZIY 8 13 42 
XYV 3 0 30 XUG 6 20 42 ZUF 8 7 54 
Үп 3 13 24 XUS 6 20 50 ZUY 8 7 36 
ZOJ 3 о 41 XYK 6 0 34 
ТУТ 3 » 19 XYM 6 30 BYW 9 — 52 
: XYN 6 4 40 СЕ] 9 20 40 
КҮ] 4 7 38 XYR 6 32 GEX 9 0 38 
QOJ 4 7 29 ZEJ 6 0 42 KYF 9 7 46 
ХІҮ 4 7 19 ZIH* 6 7 55 KYG 9 7 56 
ХОҮ 4 27 47 ZYX 6 20 56 PYV 9 , 63 
XUH 4 0 24 QUV 9 27 52 
XUV 4 0 26 BY] 7 13 54 QUX 9 33 43 
XYW 4 7 30 СҮ]* 7 20 42 QYB 9 13 50 
YEJ 4 27 54 РУ] 7 40 51 ТЕ] 9 40 52 
ҮЕО 4 33 35 СУ] 7 37 УП 9 40 43 
ZIJ 4 27 46 HYJ 7 13 61 VUJ 9 27 49 
KU] 7 33 48 XEC 9 33 40 
NE ? с Б | kN 7 0 51 {хр 9 B 30 
ne ; AES LYJ* 7 27 49 XOC 9 20 45 
OU 5 0 40 NIJ 7 20 55 XOG 9 27 39 
буу 5 7 59 NY] 7 7 58 XOL 9 27 47 
УУ] $ 0 24 QYF 7 0 43 XON 9 13 55 
XAH 5 7 2 QYW 7 20 39 XOT 9 27 74 
ХА] $ 7 42 SYJ 7 13 4 XUN 9 53 51 
XEG 5 13 3 | VYE 2 © @ || SHE 9 0 45 
XEH 50 2 ^ 3 @ | oe га № 
S z 7 41 7 20 4 УА] 9 93 
XIQ 5 7 3 7 7 36 YAV 9 7 49 
x» E D 7 20 40 YIX 9 7 34 
XUK 5 0 35 7 20 B YOJ 9 20 23 
XUW 5 0 32 7 27 45 YUF 9 0 49 
NUT Е 34 7 7 51 YUJ 9 7 37 
XYC 5 2 32 7 37 ZYQ 9 0 42 
В 5 73 7 7 28 ZYS 9 0 70 
XYS 5 42 7 23 
xv P з ом юа ә 
Я 10 
20] 5 в 3 7 0 31 GYC 5 on 2 
GYX 6 27 52 HIJ 10 74 
JYK 6 13 54 СІЈ 8 0 44 KYV 10 0 42 
ОЕ] 6 33 24 ТҮС 8 0 42 LUJ 10 47 59 
OIH 6 0 40 КІЈ 8 51 MUJ 10 33 78 
QYG 6 20 32 QEF 8 13 33 РУВ 10 7 33 
ОУН 6 38 QIY 8 13 41 QAZ 10 — 53 
VUQ 6 0 25 QOH 8 20 57 ОЕХ 10 13 55 
WYJ 6 27 62 TYJ 8 33 44 QIW 10 7 44 
XEQ 6 0 42 WUJ 8 7 39 RYW 10 47 58 
XEV 6 27 28 WUQ 8 0 44 TUJ 10 33 62 
XEY 6 13 35 XAB 8 7 45 VOF 10 27 30 
XIB 6 33 36 XAF 8 20 31 VUF 10 7 30 
NIK 6 33 55 ХАМ" 8 13 57 XIN 10 27 50 
XIZ 6 27 30 ХЕК 8 40 XIC 0 40 р 
ХОЕ 6 20 29 XUB 8 7 а ХАО 10 7 31 
хоо 6 20 53 | NUT 8 20 55 | XOM 0 з а 
xos 6 58 ZAJ 8 7 51 хос 10 0 31 


> Blank space indicates trigram was not evaluated. 


€ Item has two values. 


" Significant change in meaning. 
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TABLE 1— Continued 
Trigram Archer Glaze Krueger Trigram Archer Glaze Krueger Trigram Archer Glaze Krueger 
үс ю ов з ж | ух 
ҮОЕ 1 о 2 || GI пл & lao ми а p 
700 10 54 ZEH 14 13 
27- d GOO 13 0 54 | zux 4 2 а 
aw п 2 50 |o b 7 ps ZYR 4 13 5 
п 4 7 60 я р К 
FEQ п zx 4 | E 8B e 2 mw 5 3 0 
ЕО] 11 20 62 KYB 13 20 57 zd 15 X E 
GO n — а [MO в з жю fa m m y 
СЕО 11 40 41 МУР 13 13 47 - JJ 15 53 
GIX 11 7 47 MYW 13 2 20 EE Е 15 20 4 
СОЈ п 2 з | NUQ 13 2 то 15 e 
E HBH 55 | PYW B "EE 5 n g 
n 4 NE] z 
ск iu 7 @ || Gee nh o2 у | Bey 5 B 5 
JYD 11 38 QUW 13 7 = от. 15 2 
KUQ 11 13 63 TIJ 13 40 QOK 15 27 68 
5 1 7 TE B n ait 5 4 4 
/ п 2 5 R 5 
We оп ® 6 WOJ зв 0 а |voo Bos 58 
ХЕМ п 0 6 XED 13 7 4 vYO 15 о 4 
ют п 7 а | XEP B S 5 | ver 15 20 6 
XIG u g (| 13 40 5 | XEN 15 33 © 
XIM п 2 д |xu 5 54 | viz 5 27 8& 
ZYL 1 3 5 [үз Б 0 9 | 259 в о 95 
YEP 11 27 P one 13 27 30 20% 15 13 E 
YiQ 11 13 3 VE 13 27 s ZYK 15 3 46 
ZOF m 7 R ELS 3B 13 Ж 
ZYC 11 13 56 one = 3 BOJ 16 27 m 
; CO 
рар ве 4 sz i 7 " Dij - g 
12 DYB 
G fa gR ОЗ $22 
1 43 
со к | 8|, aw ж о g 
KYW 2 1 % [тум 14 2 45 | KE] m 5 
NOJ 2 13 63 FYM 14 62 NY б p 
ov 2 7 е 1EO м 535 7 | QEB Ü в X 
1 12 J 7 4 | OEC 48 
QIF 12 п (ЛҮ M 27 %4 16 
QUH B c B LA, м о 7 SD 65 4 © 
sy >> og & |. и » © | vaw 16 » ZU 
в з 4 | OA 33 | уон 39 
TYW p вз 5 QAV 14 4) 6 16 13 8 
7, OIG* VUH 7 3 
VAJ 12 40 53 б 14 47 55 үү 16 0 
i R 7 ól 
VEF 02 27 5 | 90У 4 9 3 6 4 2 
006 WUC 20 4 
VUB 12 60 43 г 14 0 38 16 2 
ү SIJ XAV 1 7 7 
УУВ 12 7 53 : 14 0 53 6 9 
5 VUP YIB i6 1» 3 
WYB 12 20 s 14 з 44 6 
XER 12 6 V 14 7 6 2 16 29 
CIR 55 XAN YUQ 16 7 
mB Baise E 
X 12 13 ХАЛ 60 57 38 
хыр г p 5 ХАГ и о 4 FUN E 4 7 
ХҮР 12 41 A 14 e 6 17 3 72 
200 12 XIS M 49 GAC 7 5 10 
> 39 | үрүү 14 Ў E pE 17 7 3 
YOQ 14 YW 17 7 
DEJ з 20 36 YUB 14 Е 41 ТОО 17 Ж. 
4 KIF 17 — 
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TABLE 1—Continued 
Trigram Archer Glaze Krueger Trigram Archer Glaze Krueger Trigram Archer Glaze Krueger 
LIJ 17 27 З JYW 19 7 51 GUH 22 20 57 
LYG 17 40 78 KIW 19 33 48 HOJ 22 58 
MEJ 17 13 63 KUY 19 13 58 KEH 22 20 62 
MIV 17 13 51 LIW 19 20 69 KYQ 22 27 49 
PEJ 17 33 49 NYW 19 27 63 MEF 22 0 44 
QAG 17 53 РҮ] 19 59 MIW 22 40 56 
QAS 17 7 56 QOL 19 56 NIY 22 40 59 
QEK 17 4 66 005 19 7 50 NYH! 22 33 74 
QEZ 17 40 75 QOX 19 20 43 PUV 22 40 61 
ООВ 17 0 43 002 19 53 71 ОАВ 22 40 60 
QYX 17 55 ОПЕ?" 19 60 ОАХ 22 27 78 
RYX 17 13 57 QUK 19 20 54 QET 22 47 72 
TOJ 17 40 5 SOJ 19 33 75 QOW 22 7 49 
VEO 17 13 57 VOB 19 27 48 RIH 22 27 57 
VUG 17 73 64 VUK 19 0 44 RYJ 22 13 74 
VYZ 17 20 54 VUW 19 27 52 RYQ 22 20 58 
WUX 7 e 58 WYQ 19 33 75 SUV 22 — Й 
ХАК 17 40 4 ХАС 19 33 — | TUV 22 13 68 
XAY 17 20 59 XOR 19 27 5 TYB 22 53 66 
XEB 17 7 2 XOW 9 47 46 VAB 22 47 46 
YOB 7 47 62 УАЕ 9 4  — | VAF 22 0 57 
ZEC 17 20 51 YEX 19 з 60 VEJ 22 13 57 
ZIQ 17 0 34 ZEV 19 20 56 VUD 22 27 47 
ZYL 19 13 69 WIH 22 47 59 
BYV 18 13 62 WOY 22 33 65 
БУС 18 7 FIQ 20 47 76 XAP 22 20 45 
FYD 18 33 78 JIC 20 0 51 YOV 22 0 39 
GYH 18 20 33 NYZ 20 53 48 ZAF 22 27 48 
НАЈ" 18 40 64 РУМ 20 47 74 ZEF 22 20 65 
HYF 18 40 ОЕР 20 20 72 ZUW 22 20 27 
JUF 18 20 43 QOF 20 40 66 ZYN 22 73 
JYR 8 33 74 VIH 20 20 69 
KYZ 18 — 80 WEF 20 20 67 СІН“ 23 40 38 
МУВ 18 20 4 WEQ 20 20 53 CUG* 23 40 61 
MY] 18 7 56 ZIF 20 0 60 CYH 23 49 
МАЈ 18 53 56 ZUH 20 }7 48 GYZ 23 20 52 
QOG 18 60 50 HIF 23 20 61 
QYS* 18 27  —* | він" 21 47 47 JEX 23 0 58 
RUJ 18 93 78 вом" 21 47 67 KEQ 23 64 
RYV 18 40 80 HYB 21 47 71 KIY 23 27 56 
TIW 8 27 65 JEV 21 4 56 LOJ 23 27 M 
VUX 18 7 54 HK 21 13 45 MYZ 23 13 77 
VUY 18 27 60 PIT 21 47 73 OEG 23 27 54 
VYD 18 87 59 QEM 21 7 52 QON 23 20 54 
XAC 18 4 QOC 21 0 46 QUL* 23 27 73 
XIV 18 0 42 ООР 1 27 43 QYC 23 73 79 
YOZ 18 7 51 VIF 21 100 а QYM 23 42 
WUB 21 0 54 TYG 23 72 
CEQ* 19 33 51 XEL 21 53 73 VYX 23 7 76 
DYW 19 13 68 YIL 21 0 57 WA 23 20 57 
FYK 19 20 — || 2х 21 20 60 WYX 23 20 63 
FYW 19 20 .5 ZYT 21 0 65 YAB 23 13 50 
GAX 19 0 5 ZAX 23 13 60 
GIC 19 3 44 CAQ 22 4 59 ZIK 23 7 61 
GUX 19 27 43 DAJ 22 33 44 ZOB 23 13 43 
HIW 19 51 FIH 22 40 70 
HYG 19 67 77 FIW 22 27 64 ВЕ] 24 20 57 
JOF 19 33 72 GIH 22 20 41 BIQ 24 40 76 
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TABLE 1—Continued 


Trigram Archer Glaze Krueger || Trigram Archer Glaze Krueger || Trigram Archer Glaze Krueger 
BUH 24 27 66 YAD 25 47 43 ZOL or 2 P 
BYG 2 2 64 YOD 25 з 63 ZUK 27 0 6 
CUH 24 в 68 | ZAH 2 40 39 
GUW 24 20 53 | ZAQ 25 27 69 | CEF 28 0 6 
HYW 24 20 Я | zas 25 13 55 DYH 28 27 41 
JOM а з 65 ZEG 25 7 38 DYT 28 47 2 
dv 24 e 39 | ZUB 5 7 60 | LEQ 28 2:0) 6 
КУ %4 з 73 | ZUL 25 53 59 MIH? з а 70 
мв 21 4 е | ZUN 25 7 NAF 2 62 
ML 2 ZUN 2 40 7 NAF 28 33 а 
PD а 20 зз PUJ 28 4 B 
ov 2 0 3 |во % s м lou вз y @ 
Q 24 67 71 GIY 20 20 63 бүк“ 2 40 88 
QUB 24 7 %8 HYV 5 47 6 ТАП 2 mw 7 
QUT 2 3 3$ | МАЕ 2% 2 5 | vOS в 4o 58 
RUQ 4 B Y |MX % Z 7 | vlc в 20 4 
yuo 2 4 2 |М О 26 2 7 |zaL 28 2 6 
т a a g |01 3 3 7 |у № 4 f 
ZAV 24 33 p E 26 27 63 ZIB 28 20 5 
= а & NYE 26 2 78 28 7 @ 
йм м м 2 = B 23 27 © 
7 70 
ву к. 25 87 85 2 зз 7 
BYP 25 5 2 26 27 74 29 3 D 
CEH 23 3 20 20 я 2 2 6 
5. B д 29 2 
an Lor 2 33 а ; 1n 
1 5 60 в e 7 29 83 
CYZ 25 47 7 71 2 47 
DAX 25 26 27 59 51 
: з 0 48 7 29 7 
FIY 25 7 26 7 41 57 
: 5 67 8$ 2 29 40 
FOQ 2) з м o 0 F 2 40 7 
FUP > e 26 20 64 74 
A у 20 38 р 2 - 2 29 33 
YV 25 7 74 Е 26 27 59 63 
о 5 m з |% 9 7 e 2 у 8 
JEH 5 в а 20Ү 26  — 44 A 69 
JYV 25 0 ZUG 26 47 MP 29 20 77 
KAQ" NE 2 29 33 в 
KEG à 3 5 | Bav 27 а g 2 53 @ 
KOO 25 33 62 РЕК 27 60 64 29 -— 60 
KW ж p [юу 2 s Я 2 4 ж» 
МУЕ 25 7 37 СЕН 27 55 29 33 85 
NUY ж, 5 | GS 7 з а 2 53 67 
NYO 5 1 @ | ۷ 27 з Á 2 20 6 
Е 25 MEQ 27 t 29 27 
on 3 $ 2 ME  % 8 ж nz $ 
QO 2 4 g | MAX 2 в 
QUN 25 e V QOM 7 13 | 0 4 7 
QD z y 9 [ООР 5» 2 95 n» E 
QYZ 25 33 68 RU x 27 33 > 30 33 |, 
RIJ 25 > 70 OQ 27 53 68 30 13 67 
RUW 25 20 29 УАН 27 40 30 9 
TOV 05 te VEK 27 ч з 
y 0 62 ; 4 6 30 62 
ГУН 25 27 d VIW 27 47 чы 30 27 65 
VUC 5 3 i | You a у 2 30 20 
WOO 23 в q | YS 2 y 9 30 20 7 
we в B og х UG 2 B E 30 20 79 
х 25 Z 27 2 8 30 7 mJ 
XYZ 5 4 © || Yup 27 s 0 з е 
79 ZEY" 2 0 39 
РЕ, 7 м g 30 40 (б 
30 40 
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TABLE 1—Continued 


Trigram Archer Glaze Krueger Trigram Archer Glaze Krueger Trigram Archer Glaze Krueger 
RUY 30 20 56 Сүү" 32 47 64 КЕС 34 33 59 
SY А 30 33 75 DIW 32 20 72 LIH 34 27 71 
УЕ2 0 60 63 FIJ о 80 7 MIQ* 34 53 74 
VUM 30 20 66 FIK 32 40 65 MUH 34 33 70 
WUH 30 0 5 GEK 32 27 40 MUP 4 40 75 
YAX 30 з 62 HUV 32 80 79 NIQ я 27 79 
YIK 30 32 JW 32 B 73 QAR 4 47 89 
ZAB 30 33 53 KIQ 32 53 82 QOT 4 53 82 
ZAW 30 13 79 KOH 32 40 68 TAH 3.20 62 
KUC 32 8 6 VIY 34 60 
BEX 3 — 3 LYH 32 33 67 VUZ 34 20 68 
BOF 3l 7 52 MEH 32 40 63 WOX 34 65 
BYH 31 33 6 NAQ 32 27 64 YEM 34 0 6 
CYQ 31 27 49 NOQ 32 47 %6 YIN 4 з к 
СҮХ" п 27 63 QEW 32 7 67 YUW 34 20 6 
DUY 3» 53 75 QEY 32. 20 70 YUZ 34 0 73 
FUH 31 7 %6 QYL 32 27 7 ZER 344 20 62 
FUY пи з ZA RIY 23 27 7 ZUP 34 7 82 
FYS 31 53 78 RUH 32 47 86 205 4 20 5 
САК 31 27 6 TAZ 3 67 60 
GEV пи 53 75 TEH 3 з 70 BAJ 35 з 73 
GIZ 31 4 6 TUH 3 20 75 BYS 35 53 73 
HYN 31 62 ТУЕ 32 20 79 BYX 35 40 59 
KAH пи з 70 VYL 32 73 6 CIQ 35 в 07 
KEF 31 20 5 YAG 32 27 B COH 35 27 mg 
LUH 3. 40 7 ZIC 32 20 58 DEQ 35 40 77 
MEV 31 27 39 GAH 35 0 62 
MIY 3. 33 78 САХ" з 40 73 JID 35 о 64 
МУН" 31 40 68 DYF 3 47 78 JUB 3 67 7 
NYM пи 6 70 FEG 3 47 69 JUW э 4 8I 
PIW 31 7 23 САЈ з 33 83 KUX 35 33 72 
РОВ 1 47 55 НУХ 3 27 6 MOQ" 5з 40 о 
PYO 31 75 LYB 33 53 86 NUC 35 27 
QEN 1 27 55 MEB 33 40 72 NUK 35 20 7 
QUR 3 40 6 NYF 3 4 67 PEH 35 27 50 
SAJ" п 20 7 NYG 3 60 72 PIB 5 33 R 
SYH 3 27 66 PYZ з 2 64 QIX 35 з д 
ТА] 3. 4 70 | QOD з 13 75 | QOY 3 13 я 
TEF 1 20 52 Svo з 20 6 QUM 35 47 
31 47 58 TIH 3 40 69 QYN 38 5 
пи 73 70 TUQ з 33 76 SIH 35 20 78 
п — 7 TUY з 4 78 SUQ! 35 7 87 
9 27 69 TYS з 40 69 SUW? 5 6 75 
1 33 53 ҮҮР з 27 6 ТЕВ 35 5з 68 
31 8 86 WIB з 35 S TEY 5 з 8 
31 0 50 WYK з 53 75 тү 35 20 51 
31 27 6з | WYS" 3 53 76 VEC $ 0 6 
31 33 70 ZEM з 533 а YIS 5 6 79 
п 27 7 ZIS з 13 76 YIT з у "7 
31 5з 5%4 ZYG 33 0 57 YUN! 35 20 6 
31 13 64 ZEX 35 0 63 
31 33 64 BYM 34 78 ZOG 35 13 65 
31 20 66 CUV 34 73 79 ` Y | 
DIH 34 33 73 BUP ә 
BUJ 2 о g DIY 34 60 79 CAZ о EA br 
CER 32 2 75 | FUQ и 2 а | соо % rx 5 
Br B Xd € o M 27 в | eux 36 в я 
CY 32 з 60 107 34 33 7 DYQ 36 Jo я 
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TABLE l1— Continued 


Trigram 


Archer Glaze Krueger Trigram Archer Glaze Krueger Trigram Archer Glaze Krueger 
GED 35 13 ай PAZ 38 60 7 FUB 40 13 50 
HOQ 36 87 REZ 38 33 77 GOK 40 20 72 
JAF 36 73 RIQ 38 з = | GUB 40 27 69 
JEC 36 13 6 RYS? 8 53 ğ GYR 40 60 84 
ЛА 36 27 68 SIY 38 60 79 JUK* 40 60 76 
МЕР 35 47 а SOZ* 38 4 6 | JYL'" 40 4 
1 36 47 61 SYV 38 82 KYC 40 40 
nye $ 3 n | EZ 38 4 65 | Loz A m5 m 

2 TUW 38 27 75 MOF 40 40 
QUZ’ — 36 20 77 | тух 38 з 8 | MYD 4 — 80 
ВЕН 35 40 73 ҮҮС 38 0 8 PAF 4) 13 41 
ROJ 3 27 73 VYN! 38 40 83 PEX 40 40 12 
RUZ 3 zm а WEC 33 60 а QAN 40 6 7 
SE 36 з 73 | WEM з з б RYL 40 60 82 
ТЕО 36 82 WEX* 38 53 86 TEV 40 D № 
wey © з е | wig з 47 ж | TIY 4) 60 86 
ne : 4 75 WOG 38 7 67 ТО2* 0 40 9 
ЕУ - а я |WUM № s y | vis 0 5 g 
0 %0 ХАМ 38 — 8 sg VUN' 40 3 m 
ge 3 s y | 38 40 55 WYZ 40 53 4 
7 38 о 5% | vus 20 85 
24 7 p 5 Yos 38 4 73 и 
GYB P AE MEC s B 6 ee а 9 g 
GYB 9 AD 38 47 72 | BYR 1 з 8 
нп у @ Ж ж s» y 2 || BYE n 5 7 
Doy fla ..-|9 sz 

3 27 a à 4 6 
ms Zg 63 CIZ 3 33 66 | СЕР а зы 6 
КҮР СҮМ 3 6 75 | GOH г 4 
о 7 з а DOY 3 8&0 7 HEZ ü o» д 
MIJ $ g 8 | as 9 67 ав | nc à wv n 
uo 9 ш g |®© 5 з € |Huw 4 H5 69 
T 37 33 7 a 61 
aw oy 60 5 Hur 2 ww 70 jie а а 6 

VY а : 7 7 | КА д 2 Л 
ТЕС 37 2 JEK 39 40 Е 3 
ye ам вом а ы s 

37 з 9 E 3 74 NIS 60 
ZAK KAZ 39 2 1 @ 
es ум Fixe № з € | 209 4 3 7 

U A Ir $ 2 КО a d 8 
ў NEZ v з јот а Я е 
B P3 9 л |х » 9 | 4 6) 66 
бү Oo z a QUT з F4 a UY a » © 
CUW 3 д REJ 39 E 4 4 8 
61 TUP 70 TIQ 87 

CYG 3 : 1 53 
Po A „ м X 4 Bv d M og 
БҮў* 54 Тү? 67 WEH 4 47 76 

V: 38 67 39 13 7 1 

ns 9 » B we 9 2 & BEW 64 

FUW B 47 5 39 67 È pe 47 
WA 71 FEX 81 
ы a g тр воо ggg 
wa a g sm валі gs% 
MUQ 38 33 77 39 7 60 i 
GEZ 2 а 7 
NS > i 79 | CAH 40 Gow a qq 5 
хон 38 s 0 | CEK 40 «m 2 4 в 
3 68 CI 40 66 GUV“ 86 
X 40 60 42 87 

64 LUQ 42 53 
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TABLE 1— Continued 
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Trigram Archer Glaze Krueger Trigram Archer Glaze Krueger Trigram Archer Glaze Krueger 
MIP 42 33 58 HAQ* 45 47 86 CIF* 48 67 69 
NEQ 42 40 81 HEG 45 20 81 CUK 48 60 78 
PUH* 42 40 72 JOH 45 60 80 СУБ 48 33 89 
PUO 42 40 83 JOX 45 69 DUQ 48 67 85 
QID 42 20 78 MOG 45 40 84 DYR 48 73 86 
RYZ* 42 40 85 MOY 45 73 GYL 48 60 89 
VAZ 42 33 84 MUY 45 40 70 HE] 48 27 76 
VOK 42 53 73 MYG 45 67 66 HIV* 48 83 
МЕ! 42 40 80 РЕМ 45 33 68 KAC* 48 73 67 
WIY* 42 27 76 POV 45 53 73 KIZ 48 20 78 
QIC 45 73 83 MUW 48 13 7 
FOH* 43 33 56 ТЕР" 45 87 86 MYC* 48 33 72 
FUJ 43 40 85 WOB 45 53 71 MYK“ 48 82 
FYZ 43 60 81 WYG 45 27 83 NOY 48 47 79 
GOY 43 77 WYM 45 60 78 PAJ 48 53 65 
HIY 43 60 79 YUH 45 27 75 POY 48 53 79 
PUW 43 33 72 QAD 48 40 — 
QEL 43 60 78 CYK 46 33 76 QAL 48 73 79 
RIX 43 13 63 DYS 46 67 76 REW 48 73 74 
SYB 43 82 HUZ 46 4 92 SEB 48 13 51 
VAY 43 53 75 KEV 46 20 46 VEP 48 20 65 
WIV 43 73 81 KYS* 46 60 88 VUL 48 60 93 
YAN 43 40 64 LIG 46 3 89 WYF 48 40 88 
ZAN 43 40 65 LIX 44 — 90 А 
ZON 43 33 74 LIY* 46 73 87 BIV 49 40 60 
Lys 46 53 92 BYG 49 60 83 
BEH 44 20 63 NEP 46 73 86 KYL* 49 40 93 
BEZ 44 60 77 PEV 46 66 LEB 49 20 62 
DEH 44 27 71 тон 46 33 74 MUZ 49 73 87 
роо 44 33 87 TYD 46 60 83 NEF 49 47 72 
DUT 44 80 86 VEM 46 67 81 NUS 49 60 77 
DYX 44 33 82 VOX 46 60 81 QIP* 49 60 76 
DYZ 44 60 88 YAT 46 47 79 500 49 47 86 
ЕАР 44 53 60 ZEP 46 53 91 WYL'" 49 — 88 
HIB 44 73 77 
ТАН 44 75 BYK 47 67 73 BEM 50 60 65 
JOR" 44 80 80 CIB 47 27 64 CYR 50 53 81 
ТҮВ 44 27 65 FUX* 47 33 87 DIB* 50 40 7 
KIB 44 53 79 GOM 47 67 87 DUH* 50 27 60 
NID 44 33 60 GUC 47 13 46 FAH 50 47 70 
NUJ’ 44 27 80 HIQ 47 88 FOW 50 80 85 
PEQ 44 47 78 JYF 47 40 76 GUK 50 7 50 
PIV 44 53 75 LYX 47 73 91 HYL 50 60 84 
QAM 44 20 69 MYX* 4 53 86 115 50 47 Ж 
015 44 _ | QAC 47 73 90 JYN 50 87 95 
КАЈ 44 87 90 012 47 87 КЕЅ 50 60 83 
RUK 44 4 76 RAX 47 67 90 KYN* 55 60 8 
КҮС" 44 67 63 RUC 47 33 65 KYX 50 _ 78 
КУЕ 44 67 80 RYG 47 53 86 LAH* 50 47 77 
RYK 44 27 70 SIQ 47 53 89 LOH 50 67 80 
ЕО 44 67 86 SUG 47 87 80 LUF 55 67 94 
рыт 44 27 87 ТОЕ 47 53 89 МАВ 50 80 81 
kr 44 60 78 тур" 47 47 68 MOX 50 20 76 
N 44 40 66 TUL 47 87 82 NUZ’ 50 53 68 
ҮП, 2 ГУ 47 47 89 РО7^ 50 47 81 
YOC 4 27 62 VYM 
WUP 47 13 72 PYR 50 80 87 
a 5 67 71 YAQ"? 47 52 OIK 50 67 91 
РЕ 4 60 72 ZIN 47 13 69 QIN 50 33 75 
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TABLE 1—Continued 


Trigrams Archer Glaze Kreuger Trigrams Archer Glaze Kreuger Trigrams Archer Glaze Kreuger 
ROH 5 6 в || JYG 53 47 7 BUX 56 8 93 
ЗАН 30 60 81 LER з 53 86 CIM 56 n " 
* 5 53 87 || LUP 53 84 СҮР» 56 5 7 
SYX 5 67 8 LYV* з 60 8 DEV 56 80 92 
VID 50 40 57 MEK 53 67 88 FYB 56 60 92 
VOD 0 47 6 | МЕ з 60 88 | JAV 56 87 — 9l 
WOH 530 4 83 NIM з 67 74 KOV* 56 60 7 | 
РОЕ 5. 7 58 CYA 5 27 7 
BAQ* 51 83 POG 5 = 9 LYM! 36 67 89 
BYC 51 73. 8 | PYT 3 73 9 РАО 55 27 88 
СЕР 51 76 | PYX 53 47 8 РЕВ 55 87 89 
рон 51 75 VIT 533 67 88 РУС 56 90 
НАХ 53 4 89 WUL' 53 8&0 7 WEG 55 40 86 
JOP 1 6 в | ZIM з 4 64 | WYP 5 53 8 
NAZ — 51 вю s | ZIT з 33 7 "llis 
suc H3 9» @ > ” | ny 7 67 8 
51 5 ee Ne ^ 78 
SUX SI а 8 CAG 54 73 8 | CES a 2 n 
ТАО 3 € @ FYL* 54 67 87 DYP 3 з — 
VIZ 51 2 БУХ 54 60 в ЕАУ 57 з 9 
YOM 51 i 6 GAW 54 27 6 НҮС* 57 72 
YUC оз | KEW 53 в я HYR 57 4 8 
KOS 54 95 JEZ 5 20 82 
BYR 2 47 ву | RUN 54 80 LYG? 57 80 8. 
DAK 2 4 7 LUT 54 67 89 MOH 57 40 8 
FAW ә 67 ж | Мм я во в | М 57 9 g 
FID 52 73 NUP 54 73 73 NAL 57 40 73 
FOT 2 в g | PED 54 33 6 | PIF 7 73 9 
GEY 52 47 g | BUZ 54 оз 79 | PIM 57 60 7 
GIF 2 € 5 | АК 54 3 96 TYC 57 — 9 
HEP — i$ 45; go | КАО" я 73 9 WOV 5 73 8 
HEF 2 p фу RYN 3 80 8 2АС 7 27 7 
me 2 з 6 [|$ О BM оз в А 
Jv. 2 27 gq | МЕМ 34 10 7 | BEP 58 13 6 
KR 9 в BOS я x a |в X В g 
SL g $ | YE 54 =з о | рул FEE в 
ГЕК E Я 7 EN 34 з 9 TER 588 47 | 
LEZ 52 FAZ 55 КЕТ. 8 " 
MZ в д М | № BÀ с @ |а к к 9 
МІЕ 52 HOZ = 73 76 7 
NOH 2 4 0 | нут s 9 8 zd 2 p 5 
N Е pA 8 5 
UM 2 2 g JIT 55 67 7 МАУ 2 S 74 
МҮК 2 5 go | JOD з з M | Mz а 
ot ga sa R S пум ао а 
SZ g 7 9" МУ 5 б я | № в 4 8 
2 52 60 84 MYS: 55 Xv ә 
7 Е 3 55 93 94 РУК 5 83 
SYG 52 60 NAS E 4 58 53 
ТА 2 68 25 80 91 OIL 5 0 89 
Y 32 60 go PIO 55 i RY 58 10 Я 
VOS 52 x : > 3 89 RYB 58 67 8 
A 67 76 SUH 55 ЗЕ 87 
ZAR 2 67 ж | SYK 5 2 79 | ee 58 вю PB» 
ZEK" 2 73 a TUS 4 s 9] ave 58 Я an 
EE 23 84 IE 58 [ d 
вон з & gg Ses 55 10) о TAS. s m: 8 
СУТ, 53 8 we 5 вю 8 WEK" 58 0 А 
DEZ з ө бу [т % B я | ҮР 5 40 f 
DUW* 53 Д 55 1 
FEY? 5 W 7 | ZB 55 ‚Я ВАЕ 9 z 8 
з 93 в 7 38. 74 g 
ZUM 55 у į BOC 50 8 2 
74 BYL: 59 = 85 
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Trigrams Archer Glaze Kreuger || Trigrams Archer Glaze Kreuger || Trigrams Archer Glaze Kreuger 
CEW 59 60 79 РАС 62 40 89 РҮШ" 64 87 94 
CEY 59 40 78 FIS 62 80 87 RIK 64 53 83 
CIX” 59 60 67 FOM 62 80 75 RIS 64 87 85 
DOF 59 73 77 FUG 62 67 89 ЗАВ 64 87 90 
DUV* 59 67 86 FUT 62 93 86 SIF* 64 40 90 
РУС 59 92 JUR* 62 87 92 TIX 64 73 80 
FAX 59 53 91 KUF 62 53 87 TOX 64 87 93 
РУС 59 67 84 ТУК“ 62 60 97 VIL 64 93 88 
САБ" 59 27 78 мок" 62 67 87 WAH 64 47 83 
HIX® 59 40 92 MYT 62 67 95 WAQ 64 60 79 
КОХ" 59 80 82 NAD 62 60 70 WOF 64 73 76 
LIR 59 60 92 РАН 62 40 85 YOK 64 93 85 
MOZ 59 84 TEM* 62 73 89 
NYT* 50 80 93 TUR 62 93 87 FOL 65 93 95 
SYC 59 73 80 VOM 62 53 86 FON 65 93 92 
ТАМ" 59 40 76 YAL 62 67 85 HET 65 80 92 
TIB 59 73 78 YUR* 62 53 86 HUD 65 87 93 
TIG 59 100 82 ТЕБ?” 65 33 59 
ВЕЕ 63 73 82 КАЕ 65 93 87 
CYT 60 80 92 BIC 63 87 83 KEL 65 87 88 
DOX 60 73 85 BOD 63 80 93 KIR 65 73 78 
FYT 60 73 90 BOZ 63 53 77 KOC 65 47 73 
GAZ 60 94 CEM 63 60 81 KYD* 65 73 93 
НҮК" 60 60 86 FYR 63 93 96 LEP 65 87 96 
KOX 60 85 GID в — 7 LOD | 6 80 94 
LATA 60 73 91 GOS 63 80 88 LOM 65 73 81 
NEH 60 27 83 GUF 63 27 79 LOX 65 80 87 
NER 60 73 93 GUP 63 67 70 MEY 65 91 
PYS 60 78 HIN 63 93 87 NAR 65 100 84 
SUL 60 87 95 HOV" 63 87 90 NYC 65 89 
VAP 60 53 89 JOL 63 93 93 PAB 65 60 60 
WEP 60 53 80 70Ү 63 67 93 РҮС 65 80 92 
WOC 60 59 LEG з B 79 РҮҮ" 656 60 93 
ZAM 60 53 71 109 63 87 93 SYP 65 100 90 
LUZ" 63 60 77 VIR 65 93 80 
ВАХ 61 67 90 ЕҮРр*" 63 80 95 VIX 65 40 93 
BIX 61 27 58 LYZ 63 60 87 VOG 65 40 85 
DYG 61 73 82 N 63 67 73 WID 65 80 90 
GAN 61 80 80 NY? 63 40 86 WOK 65 67 86 
GEW 61 40 73 2 63 67 86 WUS* 65 47 80 
GUZ 61 80 88 © 63 93 93 
HYS 61 60 88 SIZ 63 73 95 BEK* 66 47 94 
JAL 61 53 85 TYR 63 73 92 FIV 66 93 92 
LYR 61 60 94 VEN 63 67 93 GES 66 87 80 
NEX 61 73 87 WAZ 63 47 77 ТАБ 66 73 85 
РОМ 61 73 89 WOL 63 87 90 LUN 66 93 93 
RUP 61 67 84 YAS 63 60 99 IYD 66 93 92 
RYT 61 80 91 ZOM 63 13 67 NUG 66 93 91 
SAQ 61 60 89 PIZ 66 50 
TYE 61 40 92 CEX 64 13 69 OIT 66 87 96 
VIG 61 27 84 DAH" 64 33 74 RIZ* 66 73 85 
YAR 61 93 —° GOF 64 67 83 RYD 66 53 92 
ZEB 61 53 73 ПАВ 64 67 94 TOB 66 73 87 
KAV‘ 64 73 80 VOC 66 27 78 
ВАР 62 67 72 КЕМ 64 47 80 
BIK 62 67 90 KYT 64 80 91 САК" 67 87 89 
pyp“ 62 94 LIE 64 100 98 CIK* 67 33 84 
CEZ 62 47 81 NOM 64 93 90 DAP 67 80 89 
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TABLE 1—Continued 


Trigrams Archer Glaze Kreuger 


Trigrams Archer Glaze Kreuger Trigrams Archer Glaze Kreuger 
DAW 67 80 89 МЕС 69 87 90 CUN: 73 60 78 
DUR 9 87 92 | М0 - 69 93 оз DUS 73 93 Ж 
N 7 94 | RIL 6 80 83 GYN 73 87 
FUM: 67 80 90 RIV 69 87 95 JUN 73 80 97 
HUK 6 6 81 SAF 69 8&7 86 | кор" 73 80 
KAS 67 8 83 | SOF 69 100 96 LUM’ з 80 #4 
LEF* 67 8 95 | SYL 69 87 96 NAM 3 8 8 
m. 67 7 85 SYR 60 73 96 REK з 87 9 
; g 10 9 |} TOK ою я с | RUD 73 10 98 
MUN б 4 о | Wir 60 8 8 | SIV 3 73 87 
NUR 7 з в | WUR бо 6 g YOW 3 20 85 
NY" 6 з 0 | WOT 6 20 70 
ми бош 5 o n € o» 
0 DYM 70 73 95 COS 74 93 9 
ROR, 7 з 8 | HEV 70 73 в | Coz я оз 96 
TN @ B 9 || HUS 70 73 97 | DAF 74 53 96 
um 7 ROM m 70 8 в | DAL и юы B 
WIK 67 А 7 53 79 DUP 74 53 
wom 2 E 20 LOF 70 87 98 DUX 4 60 8 
wor ¢ № RYP 70 80 97 | FEN 74 73 9 
Үк б 4 E: SUT 70 00 ә | Hor 5 86 
7 HOF 4 87 
BIM * в | TAY 7 60 л HOX* 7 73 9 
CUL VON 70 47 79 KO 7 91 
в в $ 7 т, 74 80 
p. а | we A 87 8 May 74 s 8 
DIR B 7 84 MUI 86 
FEY 2 a 59. | 2АР 7 з а | мк » 3 а 
FIC p 85 
FOS Б A 91 ВІР" л 20 5 BE a 80 91 
BUN в у % | CER л ® в | sim %4 dg 74 
GUR 68 — 73 Е кы п 6 8 | TEK 74 40 88 
JA 8 6 g РАЗ Л 0 9% | түм м б 9 
8 — 80 gg т VIK 74 100 95 
ТУР HES 71 27 " 
ш g gaje a м я ш 5 
PES 68 100 95 xd 71 10 %5 BEY 75 
$m goa ite IEE п sm 
100 90 CEN 5 91 
E с 75 100 
м вос по 2 Y à 
VAM в № и | вы э | RES m e ж 
vov @ 89 ы |в 2 $9 s |29 ж w g 
VOY 6 73 Wu E n X 4 75 
woz в 2 м (HD 7 в | нк 5 9 f 
S EBENE MEM 
8 8 a 97 
CETE to ий | м 72 4 m ko 5 9 39 
pve б о о Куу 72 m я | KOR 97 
DYK" 6 g N 72 89 5 n 5 93 
FES 69 100 E: NOP 72 60 Fi LIB 75 93 91 
FOD 69 REL 72 7 MOS 75 87 95 
HOD  & ® 8 [| TD 2 F3 95 | NOK 75 = 9% 
HUC ө % a | vor 2 ву оу | PEL E g 
ТҮМ 6 в 4 "Ee в а و‎ | oe S 
1 6 VAG 75 73 9 
KOM 69 80 
а 89 CED* VIS 75 100 9 
LUB 69 47 gg COV g 60 вз 
3 
3% | Brp 76 в 8 
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Trigrams Archer Glaze Kreuger Trigrams Archer Glaze Kreuger Trigrams Archer Glaze Kreuger 
DAZ* 76 96 KIP 79 80 91 RAF 81 80 96 
FAK 76 93 94 NOC 79 87 = SEM? 81 100 90 
JUS 76 10 97 NUL 79 93 89 SIM 8 6 95 
KER 76 53 89 PAX! 79 8 83 TOS: 8 93 95 
KOG 76 40 74 REY* 79 80 95 VAC 81 100 97 
ГАК 5 93 в | RIF 7 8&7 9 || WOR 8 8 9 
MAH 76 73 93 SOG 79 73 86 WOT 81 94 
MER 76 87 91 | SUK 7 0 95 | WUN 81 60 9% 
MUR % 93 92 | TER 79 10 95 | ZIG я 6 82 
NUB 76 13 73 VEG 79 87 84 ZOW 81 7 83 
PAV 76 93 86 
RAB 76 93 93 BEC* 80 10 92 CIR 8 03 8 
RES 76 87 94 CIS 80 97 COL 82 100 97 
ROP 76 10 96 | DER so sz 91 | РЕК 82 93 90 
RUL* 76 93 83 GAT* 80 92 DIZ 82 80 97 
TEW* 76 27 80 GIB 80 87 90 DOP 82 93 97 
TIF* 76 80 89 GIM 80 67 96 FAM 82 100 99 
VAS 76 | 93 96 JAS 80 93 96 GON 2 99 9 
YAW 76 JOK 80 93 96 GUL 82 100 95 
KAW 80 53 86 HAK 82 67 93 
BUK 77 80 89 KOB 80 53 83 JER 82 93 93 
CAV 77 80 96 KON 80 53 86 JIB 82 60 84 
FAC 77 87 92 KUD 80 53 82 LOK 82 80 97 
БЕТ 77 60 94 LEX 80 90 MUC 82 80 91 
GAV 77 67 87 MIK 80 80 95 PUD 82 93 95 
KUS 77 60 94 MOD 80 100 96 ROK 82 73 94 
LEV 77 93 94 NEV 80 87 95 ROM 82 100 95 
RYM 77 93 93 КАР“ 80 100 95 VIN 82 100 93 
WIF 77 73 93 ВАУ“ 80 100 96 VOL 82 93 97 
YEH 77 80 85 ROV 80 100 94 
ROZ* 80 67 89 BER* 83 100 92 
BIZ 78 100 95 TES 80 100 9% BOL 83 10 96 
COK* 78 73 93 TIS 80 47 88 BOT* 83 87 91 
DEP 78 100 95 TUK 80 87 95 DAG 83 67 90 
DEX в B 8 TYP* & 93 9 DIV 8 93 95 
FOK в 80 83 YEW 80 93 DUF 8 4 9 
GOL" 7 87 93 FEB 8 з 81 
KAL 78 33 93 BIR 81 87 91 FOB* 83 
KOW 78 60 96 COF 81 93 98 GAR 83 99 
KOY 78 33 89 FOY 81 33 77 GOR 83 80 93 
KUZ* 78 86 FUC* 81 94 HAR 83 73 93 
LIM 78 100 98 GER 81 87 92 КАВ* 83 47 87 
MIR 73 93 9% | GIR я 87 87 || KUM m в B 
MOV 78 87 93 HAF 81 73 89 LAT 83 80 93 
93 90 LIS 83 100 95 
NOL 78 47 83 HOS 81 
POS 78 93 94 HYP 81 87 99 LOR 83 80 93 
4 73 96 NEY 83 53 91 
REN* 78 80 86 JAK 81 v 
TEC' 78 80 86 JUL 81 — 93 NOG 83 67 83 
YON 78 80 91 KUR 81 67 93 POC 83 92 
YOR 78 47 91 LEM 81 80 87 РОМ 83 87 94 
ЖОК" 78 20 74 LEY 81 67 91 PUC 83 95 
MIC* 81 53 95 РОК 83 47 83 
D 100 97 MUS 81 93 99 RAS* 83 93 95 
Bir, 79 93 97 МЕК" ы 93 96 SYD" 83 60 96 
FUK 79 87 NIB 81 80 VAR 8 03 85 
HOY 79 88 NIG* 81 93 97 WAF 83 73 90 
JAX 79 47 87 NOX 81 67 81 WAM 83 67 90 
1 79 93 95 NOZ* 81 80 94 WIZ* 83 87 94 


JES 
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TABLE 1—Continued 


Trigrams Archer Glaze Kreuger Trigrams Archer Glaze Kreuger Trigrams Archer Glaze Kreuger 
WYN 83 98 BUF 87 73 91 KIC 89 60 91 
CUF 87 93 97 КОР" 89 67 98 
ВАМ 8 60 9 | GIP 8 60 оз | LIC 8 93 97 
DIS м 100 97 | HOL 87 10 9 | Lon 80 10 9% 
DIT 8 80 85 | KUP 8 6 90 | MES 89 60 97 
FER 8 80 93 | LYN*. 87 8&7 оз | MUF 8 93 97 
HIR 8 з 9 МАК 87 100 9 МАН 89 13 8 
HUL 8 80 98 | MOL 87 9 95 | NEC 80 9з %8 
нок — 8 10 99 | NIK 8 6 9 | PEC 80 73 96 
KED'" 8 4 9 | ROS 8 93 95 | POL 89. 93 99 
MOC 8 8 95 | SER 87 100 96 SUD 8 93 96 
NOS 84 93 % | SEZ 87 6 95 TUF 80 87 93 
NOV 84 100 88 SYN* 87 100 97 WAV" 89 87 94 
РЕК 8 8&7 97 || Tor 87 во 98 | үр 89 89 
РОК 8 93 95 | TUG 87 80 98 
EM и әз 1 lvow  g BUL 90 100 9 
т & 10 9*9 | мк я ue в 3 95 
SIK 84 80 96 | Б o o» EZ 
TIK 84 80 0 | à 
Toc ы з м | в s o (DAT яв n 
WAL — 8 07 FRE 88 _ DIK 90 s 9 
WER Mon g К 8 8 % DOV 90 10 98 
WES  & DAR — 88 10 95 | DUC o) 10 96 
80 я DEF = 
m. з 9 о | FEM 90 93 [d 
BES 85 2 . % | FIZ 90 1 
GEN 83 a = BS 8 87 97 FUS 90 o 97 
КИ 5 в 0 | в , 9 GOP 9 s 9 
OMM в а |) tee & и || НАС о 5 £ 
КМ в ж d [ha 8 8 о | HEX 9 s 8 
мк в J 88 91 HOB 100 87 
pom 85 60 ә | Las 88 20 
RAK ы ® Pr ва " [м я m 9" 
AUR 5 87 9 Xm JUT 9 
TAL в 10 в | EUR $ 8 % Кїч 9 5 
85 \ 3 9 А 9 
мс в S E 8 о бф De g з 10 
WM 85 g RUS 88 10 % [р E 
91 SEC 88 93 оз PRR 20 92 
CIG 86 SEN 88 80 „ M 5 
DAV в dw % Ee B @ у Ara A 100 
uU адаа g м 2 ком 
SYP в ^ | в am w FL 0 WM 8 
HYM 86 E a . 88 53 77 SUC 90 73 
JOC % з æ | у в Ww 20 04 
Кт в m y YUL ве 89 ә VEN ш 80 
86 60 53 
LUC 86 К ZAG 88 6 > ВАС Ий а on 
ee a о 10 | ВАГ, 9] 100 ‚ 
МОТ 86 80 s BAK 89 " BUR 91 100 9 
REB ю о | CIN 89 100 o6 | CAW 91 " 
uv po" o» bw LO m» ш и 
{ 86 9 e 89 = 
SED в ж db у ® 9 | DUL 9) 10 % 
SOR — 86 y | сю к * AW о ж 9 
WIP в gw Z |у 8 100 g | HOM or 100 9 
wUz m $$ | HED вю о | HOR ك‎ 8 n 
YAC 8 پو‎ d | Jac ю g @ | HUF n *€ - 
yur! в 2 6 | JED ю Z 9 ШИ 9 67 2 
2 | JN : 72 | LIK 91 10 9 
89 100 9 
99 | МАЗ 91 93 ? 
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Trigrams Archer Glaze Kreuger || Trigrams Archer Glaze Kreuger || Trigrams Archer Glaze Kreuger 
NEL 91 80 95 BAH 94 94 SIC 95 93 93 
PIK 91 80 98 BEV 94 100 90 VAT 95 
REG 91 10 94 BIL 94 93 93 YAP 95 40 89 
RIC 91 93 88 COX 94 
ROL 91 67 98 DEM 94 73 98 BOG 96 
ROX 91 73 95 DIC 94 10 91 BON 96 97 
SUR 91 100 98 DIL 94 80 94 CAL 96 
YAM 91 DOL 94 100 DEC 96 100 94 
FAG 94 Б DUB 96 
2 98 FUZ 94 93 DUM 96 99 
ОМ E 94 HAZ 94 100 95 FIR 96 
COM 92 97 HEW 94 HUN 96 
CUM 92 93 94 HIL 94 100 99 BET 96 99 
DIX 92 97 JAB 94 _ LUX 96 99 
DOR 92 87 98 KAN 94 87 96 MUT 96 
FUL 92 80 97 KAP 94 96 NAB 96 
HAP 92 99 KID 94 NAT 96 99 
KAR 92 53 95 LOV 94 100 100 SOY 96 80 87 
KIL 92 93 99 | MAL 94 93 98 | SUP 96 100 
LEW 92 98 MIN 94 93 99 VEL 96 87 92 
LOB 99 99 9 MOR 94 97 
NIC 92 97 NAW 94 47 91 ВАМ 97 93 
NOR 92 PIX 94 67 88 CAD 97 
PAS 92 10 99 PUR 94 93 96 COB 97 
PED* 92 100 92 RAZ 94 87 99 COY 97 
PUF* 92 100 100 REF 94 93 93 DEB 97 87 99 
RIN 92 87 88 REP 94 98 DEL 97 80 97 
SAN* 92 80 98 RIT 94 98 FAB 97 9 92 
SIG 92 87 94 ROC 94 87 97 САР 97 98 
SIL 92 93 95 SAK 94 93 % GIT 97 97 
YEL 92 93 99 SOL 94 93 100 GOB 97 93 97 
SOP M М9 " GUS 97 100 
93 100 97 TUN 9 7 HEY 97 
BOS 93 95 VIM 94 р HON 97 100 98 
CAM 93 WAT 94 10 95 HUG 97 
CIT 93 94 WEL 94 95 HUM 97 
CUS 93 100 98 WIS 94 100 94 JEL 97 100 98 
CUZ 93 3 91 И MAW 97 
FEL 93 100 99 95 MEL 97 67 95 
FIL 93 80 94 95 MIS 97 80 9 
HAV 93 100 97 95 MON 97 100 98 
KIS 93 87 9 95 . 96 NIX 97 
LAF 93 87 10 95 87 94 РАМ 97 80 90 
LIN 93 87 95 95 з 95 PAW 97 
Lv 93 10 98 95 99 | PEW 97 
LOS 93 100 97 9 з 9 POD 97 
MOW 93 95 87 98 PUS 97 100 
NUM 93 67 90 95 SOC 97 100 98 
PAC 93 100 97 95 86 TAC 97 100 98 
PAK оз 8 9 95 47 80 TAD 97 96 
L 93 10 99 95 87 99 ТАХ 97 
d оз 100 95 95 8 9 TIL 97 80 100 
ха Б 93 80 91 95 87 96 TOG 97 73 97 
ВЕЕ 93 10 9% 95 6 90 TOW 97 
REN © 96 95 TUX 97 8 9 
we HH 9 9 3 7 | VIC 97 10 97 
E d оз 27 88 95 93 96 WAS 97 
YA 95 97 WIL 97 100 96 
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TABLE 1—Continued 


Trigrams Archer Glaze Kreuger Trigrams Archer Glaze Kreuger Trigrams Archer Glaze Kreuger 
WON 37 сор 99 РАТ 99 
53 CUB 99 PAY 99 
YEN 97 90 | DAB 99 PEG 99 
Er 97 2 у DAN 99 PEN 99 ds 
1 3 9 | DEW 99 PIC 99 93 
DIM 99 PIG 99 
BOP 98 53 69 | DUZ 9 4 8 PIT 99 
COT 98 FAN 99 POT 99 
DOC 98 100 | FAT 99 PUG 99 96 
DOZ 98 10 94 | FED 99 PUT 99 
FAY 98 FEW 99 RAG 99 
GAB 98 93 10 FIB 99 RAH 99 60 100 | 
GYM 98 10 97 || FIG 99 RAN 99 
ЕМ 98 FIN ФАР 
ШЕМ 2 N 99 RAP 99 
ШЕЙ » FOG 99 RAW 99 
HUB 98 FUR 99 RED 99 
pn e GAL 99 98 RID 99 
JAY x GAP 99 100 || RIG 99 
JIG 98 Ed о RIM 99 
JOT 98 ROD 99 Е 
сот 99 R 85 
M B. pm s ow» ^ 
rep Е. GUY 99 и © 
LES 98  g оу | HAG 99 RUT 99 
LUG 98 HAL 99 100 SAT 99 
NAY 98 НЕР 99 87 | SAY 99 
NIL 9 € y | HE e % | SET 99 
NI : 
NOR z JAG 9 6 9 SIN. » 
PUN o8 91 | JAN 10 99 || Sip 99 
REX 98 JAP 99 SIT 99 
хз „|, |н o2 
SAX > 97 | KEG 99 % im 99 
р 98 87 97 KIN E SUN 99 
X 
SUB № qw $ |LAD 9 m 0 99 
TAM 98 100 101 | LAY fe TEN x 
TOD ә LEG 99 TIC К в | 
TON 98 LID 99 С 99 100 
TUM 98 з gy tow > TOM 99 
99 
MAC 9 TOY 99 
C» Ms g "omm A 
> вю к 9 in п vs" 
BEN 099 MAX — 99 в gs | WAD o% 
MID WAG 99 
BID 99 99 87 99 
BIG 99 MIX 99 WAY 99 
BOW 9 no 4 "s 9 
BOX 99 MOP 99 
BUG 99 MUG 99 BAN 100 
BUS 99 NAG 99 BAR 100 
BUT 09 NED 99 ВАТ 100 
BUY i NEW 99 BAY 100 | 
BUZ 99 93 96 NOW 99 BED 100 
CAR 99 PAN 99 BEG 100 
PAR 99 BET 00 
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TABLE 1—Continued 


Trigrams Archer Glaze Kreuger Trigrams Archer Glaze Kreuger Trigrams Archer Glaze Kreuger 
BIT 100 HIT 100 PUB 100 93 85 
BOY 100 HOG 100 RAM 10 

BUD 100 HOP 100 RAT 100 

BUM 100 HOT 100 RAY 100 

BUN 100 HOW 100 RIB 100 

CAB 100 HUT 10 RIP 100 

CAN 100 JAM 100 ROB 100 

CAP 100 JET 100 ROT 100 

CAT 100 JEW 100 ROY 100 

CON 100 97 JIM 100 RUG 100 

COP 100 JOB 100 RUM 100 

COW 100 JOY 100 SAD 100 

CUP 100 JUG 100 SAG 100 

CUT 100 KAY 100 60 97 SAM 100 

DAM 100 KEN 100 10 99 SAP 100 

DAY 100 KIT 100 SAW 100 

DEN 100 LAB 100 100 SEX 100 

DIG 100 LAG 100 SIR 100 

DIP 100 LAP 100 SOB 100 

DOG 100 LAW 100 SON 100 

DON 100 LET 100 SUM 100 

DOT 100 LIZ 100 80 94 TAB 100 93 
DUG 100 LOG 100 TAG 100 

FAD 100 LOT 100 TAP 100 

FAR 100 MAD 100 TAR 100 

FIT 100 MAP 100 TED 100 

FIX 100 MAT 100 ТЕХ 100 100 97 
FOR 100 MAY 100 TIN 100 

FOX 100 MED 100 93 98 TIP 100 

FUN 100 MEN 100 TOP 100 

GAS 100 MET 100 TUB 100 

GEM 100 MIT 100 VAN 100 

GET 100 MUD 100 VET 100 93 99 
GIN 100 NAP 100 WAR 100 

GUN 100 NET 100 WAX 100 

HAD 100 NOD 100 WEB 100 

HAM 100 NOT 100 WED 100 

HAS 100 NUT 100 WET 100 

HAT 100 PAD 100 WIG 100 

HAY 100 PAL 100 WIN 100 

HER 100 PET 100 WIT 100 

HIM 100 PIN 100 ZIP 100 

HIP 100 POW 100 60 95 


а Significant sex difference. 8 
b Blank space indicates trigram was not evaluated. 


c Item has two values. . 
4 Significant change in meaning. 


In the “Glaze” and "Krueger" columns 
а blank space (Footnote b) indicates that 
Glaze or Krueger did not evaluate the item. 
Both authors deliberately excluded most 
three-letter words. Footnote c in either 
of these columns indicates that two different 
values for the item were reported by each 
author. Glaze had 29 duplications (and 


most of them were not eliminated in Hil- 
gard's chapter), and Krueger had 14 dupli- 
cations. 

In the “Trigram” column Footnote a 
means that there may be a significant differ- 
ence between the sexes for that particular 
trigram. А chi square (corrected for dis- 
continuity) for the 2 X 2 contingency table 
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of sex X meaning was computed for all 
2,480 items. If the chi square exceeded 
3.841 (the value needed at the .05 level for 
1 df), the trigram is annotated with Foot- 
note a. There are 226 such items. Theo- 


retically we might expect 124, 1.е., .05 X 
2,480 chi squares to be significant at the .05 
level by chance. Therefore, it is unlikely 
that there is а true sex difference for all 
226 trigrams listed in Tables 2 and 3. 


TABLE 2 


192 Trickams WHICH APPEARED то ВЕ More MEANINGFUL то WOMEN 


ч Association Value 
Trigram 


Association Value 


Chi Square || Trigram Chi Square 
Overall Women Меп Overall Women Меп 

BAQ 51 59 44 4744 57 65 49 4834 
BEC 80 86 73 4.823 89 94 а 4.866 
ВЕК 66 76 5 9.065 68 moms 9,321 
BEQ ш 50 32 6.192 88 95 81 И 
3 89 78 4, .92 
box 8 GR m D$ f$ p 
BOT 83 90 77 5.633 40 51 30 9.315 
BYD 62 75 49 14331 3 78 4.033 
BYL 59 69 49 7644 83 89 7 6013 
BYQ 24 33 15 9. 48 56 39 5542 
СА Я i 9.144 41 4 2 ты 
Е 67 5% 5 52 е 4.173 
CAX 33 7.521 25 31 19 Ht 

к, 40 26 4112 84 Е 78 58 
CED 73 8 — d 11 91 7 НЕ 

4.570 5 2 5 5. 
СЕО 19 25 13 Е 54 62 45 203 

eid 4.335 7 53 
CES 57 Е 73 81 66 266 
2 49 4.834 c У 4.86 
СЕТ 69 76 61 482 89 04 84 238 
CID 90 96 84 Fo 56 64 49 A892 
CIK 67 %4 60 4112 35 60 я 1170 
CIX 59 67 5 315 78 89 67 pur: 
CIZ 39 4% р 49 5 42 FI 
CYD 75 31 4.383 50 ES 4l 6.00 
CY] ; 7 & 14.402 4 3 18 4.280 
CYN П 3 4.585 2 10.741 
СУ 2 8 е 7402 2 у M 685 
= 0 66 47 7 50 59 41 бе 
em 8 ш aq nd 9 5 8 ЕА 
En o өю я HUE 75 2 68 5.356 
Du a » в бв 9 4 60 H 

64 73 5 ‘ 75 2 
DAZ 76 $^» 6.539 he м 4 6.361 
DI п юю 5 7180 46 5 3 pr 

30 57 USt as 4. 
DUH 95 в 2 4741 a и S8 430 
DUK 80 a 2 13.500 47 32 2201 
DUV 59 2 a 4.866 8 94 82 1260 
DUW 53 el 3 10.144 7 65 50 < 468 
DYK 69 с d 5.368 E 12 3 2681 
DYL 58 7 9 4.241 s 7 54 10107 
Dvv 38 44 6.152 94 79 4 467 
FEN 74 ы 2 3.872 63 70 56 4190 
FOH 43 i £ 6.101 46 54 39 4347 
FUM 67 UNE 4.260 60 68 — 53 1368 
СОН 4l 50 x 7.521 53 61 44 7006 
GOL 78 a 22 6.192 38 47 29 102 
НАО 45 54 = 3.916 28 35 20 4.056 
HIX 59 р 3 6.063 34 41 27 7827 
HOX 74 8l 0 4315 62 71 52 10.739 
НУС 57 a s 4.074 35 46 24 Ho 
HYK 60 m P 5472 76 84 69 11200 
2 4.927 48 60 %6 190° 

31 38 23 
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TABLE 2— Continued 
Association Value . А Association Value 

i i Overall Women Men Ci Sadere: | ESE Overall Women Men Сб sauare 
MYK 48 58 38 8.178 SAN 92 96 88 4.086 
MYL 57 66 49 5.472 SEY 88 93 82 4.233 
MYN 54 62 46 4.774 SOH 50 61 40 8.964 
MYQ 26 32 19 4.074 SOK 88 93 82 4.233 
MYR 58 67 50 5.505 SOM 88 94 81 7.389 
MYS 55 62 47 4.203 SOR 86 92 80 5.424 
NAC 72 80 65 5.192 SOZ 38 49 27 10.399 
NEC 89 94 83 5.672 SUW 35 42 28 4.003 
NEK 81 89 74 6.903 SUZ 68 78 58 8.518 
NOH 52 60 44 4.751 SYD 83 90 76 6.392 
NOS 84 92 77 7.854 SYN 87 93 81 5.736 
NOZ 81 88 73 6.650 SYT 58 66 50 4.861 
NUF 69 77 60 6.203 TAW 59 68 50 6.192 
NUH 38 46 29 6.400 TEM 62 69 54 5.009 
NUR 67 75 58 6.021 ТЕР 45 53 38 4.203 
NYL 67 80 55 14.183 TEW 76 82 69 4.280 
NYT 59 67 51 4.892 TIF 76 82 69 4.280 
PEC 89 95 83 7.007 TIR 67 74 59 4.688 
PED 92 99 85 12.514 TIZ 75 83 68 6.401 
PEZ 51 58 H 4.168 TOQ 27 34 20 4.570 
PIR 74 81 67 4.671 TOS 81 90 72 9.754 
POH 52 59 H 4.172 TOZ 40 49 31 6.975 
POM 85 92 78 7.010 TUC. 87 92 81 3.983 
РОВ 90 94 85 4.099 ТУК 61 75 47 16.383 
POZ 50 59 40 7.408 TYL 68 77 59 6.900 
PUF 92 96 88 4.086 TYM 74 81 67 5.424 
PUH 42 50 33 5.505 TYP 80 89 71 9.408 
РУТ, 64 72 56 5.799 TYZ 39 48 31 6.285 
PYN 65 72 58 4.003 VOZ 27 35 19 5.970 
QOR 25 36 15 11.806 VYC 38 46 29 6.400 
QUL 23 30 17 4.398 WAV 89 94 83 5.672 
QUT 39 49 30 7.759 WEJ 42 50 34 4.861 
QUZ 36 43 29 3.956 WEK 58 66 51 4.286 
QYS 18 25 11 6.133 WEX 38 45 31 4.423 
RAQ 54 61 46 4.190 WEY 72 81 63 7.402 
RAS 83 89 78 4.033 WIM 85 94 76 11.589 
REN 78 84 71 4.527 WIX А 51 36 4.238 
REY 79 89 69 12.181 WIY 42 49 34 4.286 
RIZ 66 75 57 6.704 WIZ 83 89 78 4.033 
ROG 90 94 85 4.099 WOS 54 62 46 4.774 
ROZ 80 86 74 4181 || WUL 53 61 45 4.761 
RUF 86 92 80 5.424 WUS 65 77 54 11.765 
RUL 76 82 69 4.280 WYL 49 57 41 5.354 
RYC 44 53 34 6.799 WYS 33 41 25 5.371 
КҮР 70 78 62 5.634 XAR 25 31 19 4.73 
RYS 38 48 27 9.560 YAQ 47 55 40 4.180 
RYZ 42 50 33 5.505 YUR 62 70 53 6.340 
SAT 31 40 22 7.010 YUV 26 32 19 4.074 


to 
N 
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TABLE 3 


NIN! МЕХ 
34 Тысвамз WHICH APPEARED TO ВЕ More MEANINGFUL TO Mr 


Association Value 
Trigram 


Association Value 


Chi Square 
M ан | médias Overall Women Men 
Overall Women Меп 
, 75 6.021 
7 5737 MEX 67 58 (9 6.130 
ny s" 5 2 445 | MIG 2 B B 6.650 
BIW 1 15 is 25 4.042 NIG 81 73 E. 4.313 
ae P 2 —» 7423 || NUX 30 23 20 6.542 
CUG з 6 30 5174 || QIG 2 à @ 4.168 
DEG 45 33 5 10.778 | ОТР Ei CENE. 724 
83 78 89 4033 | SIF Е 49 4.892 
РОС 55 44 6 9.881 A 2 S 37 ur 
7 5.900 : 
FUS 47 = $ emm | 35 40 5 g 4.671 
GAS 59 45 72 14.982 WAP 74 e a 4909 
САТ 80 73 86 4823 | XAW 8 1 6 5315 
HAJ 18 11 25 6.133 XUJ 3 2 38 4909 
НЕВ 52 43 61 6604 | ҮЕС 31 ; 50 4.17 
Е 2 uo 3 324 
ТАТ 41 з 4 4.892 || ZEK 5 | 4 4 
JEB 65 58 72 4003 | ZIH 6 | E 4.066 
JIZ 36 28 4 5.799 | ZIV 39 3 


For 192 of th 
difference was i 
in f 


peared to be m 
shows th 


for the two Sexes, and the chi s 


to be more meaningful f 


Measures of reliability. One measure of 
the reliability of judgment was obtained 
from the test-retest correlation of the num. 
ber of "Yes" responses made by $ to the 
first film Strip and to the identical stri 
Which was repeated after 5 saw all of the 
others. Гог 16 of the Sroups this measure 
was based on a 300-item "test"; 
of the groups there were о 
the "test" The Pears 


or men. 


Another measure of г 
tained by analyzing the Tesponses for each 
individual trigram. Since two different 
sequences started with the same film strip, 


eliability was ob- 


+ 149118 
we have eight groups of 24 55 each и. 
300 different trigrams twice (and yo 
of 24 Ss making two judgings a fout- 
trigrams of the short film strip). orrela- 
fold point coefficient or tetrachoric c with 
tion based on but 24 cases and mid à 
zero cell entries is too unstable to be W | 
While. On the other hand a measure icula 
change in meaningfulness of 3 p which 
item could be evidence of reliability Jess 
would be stable since it would ter 
affected by zero cell entries, Such a me uare 
of reliability was computed: a chi 5d es" 
test for a difference in number of 
responses on the two "testing sessions. p. 
Chi square test for correlated values 
Used; and, if Footnote d appears ficate? 
"Trigram" column of Table 1, it indi oY 
a significant change in meaningfulness werê 
а 48-hr. period. Most of the changes the 
increases, ie. more “Yes” responses ОЛ 
second testing than оп the first. in 
Either notation (Footnotes a or d) a 
"Trigram" column is а caution ee 8 
the potential user Since it indicates €! 
sex difference or a lack of reliability. айо" 
Comparability of the three шр 25 
Values. One immediate question is: ? the 
the task worthwhile?” One answer 


” 


the 
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question is the three overall correlations 
among the three studies, but this is a some- 
what deceptive answer. When an E wants 
to construct a list of trigrams, he rarely 
samples from the entire possible population. 
The usual practice is to select items with a 
predetermined level of meaningfulness. Ac- 
cordingly, a better answer to the question 
of the worthwhileness of this re-evaluation 
is to be found in the correlations of the 
three sets of values in the more restricted 
ranges from which an E might construct 
lists, e.g., high, medium, or low association 
values. In the present study there are 510 
trigrams with an association value between 
1% and 25% (none was found with 096) 


TABLE 4 


CORRELATIONS AND MEANS FOR EACH OF Four 
RESTRICTED RANGES AND THE ENTIRE RANGE OF 
ASSOCIATION VALUES OF THE THREE SCALES 


Means 


Range N г так "ок үт — 
ac M Archer Glaze Krueger 


1- 25% 510 .341 .590 .435 14.6 21.6 51.6 
26- 50% 561 .354 .406 .489 374 39.5 71.9 
51- 75% 440 .311 .341 .522 62.9 66.7 85.7 
76-100% 422 .163 .286 .559 873 821 93.6 

1-100% 1933" .788 .850 .790 481 503 745 


a After the elimination of Glaze's 29 two-valued items 
and Krueger's 14 two-valued items, there were 1,933 items 
for which all three studies reported a single value. 


for which Glaze and Krueger also report 
values. А Pearson product-moment correla- 
tion for these 510 items was computed be- 
tween the association values found in the 
present study and each of the other two. 
For purposes of comparison, the Glaze and 
and Krueger values were also correlated. 
Comparable correlations were computed for 
each of the other quartile intervals. These 
values, along with related means and stand- 
ard deviations, are shown in Table 4. 

As may be seen from Table 4, although 
the overall correlations are fairly high, .788 
to .850, the correlations for the restricted 
ranges are rather low, from .162 to .590, 
indicating that Glaze's and Krueger's values 
may not provide the best description of the 
meaningfulness of trigrams. 


SUMMARY 


The association values for 2,480 CVC 
trigrams were determined. Two measures 
of reliability of these values were also ob- 
tained. The association values obtained in 
the present study were correlated with those 
previously reported by Glaze anc Krueger. 
The overall correlation was fairly high, but 
for four restricted, and more useful, ranges 
the correlations were rather low. It was 
concluded that the present scale offers a 
better estimate of the meaningfulness of 
trigrams than was previously available. 
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THE INFORMATION AVAILABLE IN 
BRIEF VISUAL PRESENTATIONS' 


GEORGE SPERLING? 


Harvard. University 


ow much can be seen in a single brief 
H exposure? This is an important prob- 
lem because our normal mode of seeing 
greatly resembles a sequence of brief ex- 
розигез. Erdmann and Dodge (1898) 
showed that in reading, for example, the 
eve assimilates information only in the brief 
pauses between its quick saccadic move- 
ments. The problem of what can be seen 
in one brief exposure, however, remains 
unsolved. The difficulty is that the simple 
expedient of instructing the observer of 
a single brief exposure to report what he 
has just seen is inadequate. When complex 
stimuli consisting of a number of letters are 
tachistoscopically presented, observers enig- 
matically insist that they have seen more 
than they can remember afterwards, that 
is, report afterwards. The apparently 
simple question: "What did you see?" re- 
quires the observer to report both what he 
remembers and what he has forgotten. 


1 This paper is а condensation of a doctoral 
thesis (Sperling, 1959). For further details, 
especially on methodology, and for individual data, 
the reader is referred to the original thesis. It is a 
pleasure to acknowledge my gratitude to George А. 
Miller and Roger N. Shepard whose support made 
this research possible and to E. В. Newman, Js 
Schwartzbaum and 5. $. Stevens for their many 
helpful suggestions. Thanks are also due to Jerome 
S. Bruner for the use of his laboratory and his 
tachistoscope during his absence in the summer 
of 1957. This research was carried out under 
Contract AF 33(038) -14343 between Harvard Uni- 
versity and the Operational Applications Labo- 
ratory, Air Force Cambridge Research Center, Air 
Research Development Command. 

? Now at Bell Telephone Laboratories, Murray 
Hill, New Jersey. 

35 Some representative examples are: Bridgin 
(1933), Cattell (1883), Chapman (1930), Dallen- 
bach (1920), Erdmann and Dodge (1898), Glanville 
and Dallenbach (1929), Кире. (1904), Schumann 
(1922), Wagner (1918), Whipple (1914), Wil- 
cocks (1925), Woodworth (1938). 


The statement that more is seen than can 
be remembered implies two things. First, 
it implies a memory limit, that is, a limit 
on the (memory) report. Such a limit on 
the number of items which can be given 
in the report following any brief stimulation 
has, in fact, been generally observed; it is 
called the span of attention, apprehension, 
or immediate-memory (cf. Miller, 1956b). 
Second, to see more than is remembered 
implies that more information is available 
during, and perhaps for a short time after, 
the stimulus than can be reported. The 
considerations about available information 
are quite similar, whether the information is 
available for an hour (as it is in a book 
that is borrowed for an hour), or whether 
the information is available for only a frac- 
tion of a second (as in a stimulus which 
is exposed for only a fraction of a second). 
In either case it is quite probable that for a 
limited period of time more information 
will be available than can be reported. It 
is also true that initially, in both examples, 
the information is available to vision. 

In order to circumvent the memory limita- 
tion in determining the information that 
becomes available following a brief ex- 
posure, it is obvious that the observer must 
not be required to give a report which 
exceeds his memory span. If the number of 
letters in the stimulus exceeds his memory 
span, then he cannot give a whole report 
of all the letters. Therefore, the observer 
must be required to give only a partial re- 
port of the stimulus contents. Partial re- 
porting of available information is, of 
course, just what is required by ordinary 
schoolroom examinations and by other 
methods of sampling available information. 


An examiner can determine, even in a 
short test, approximately how much the 


N 


student knows. The length of the test is 
not so important as that the student not be 
told the test questions too far in advance. 
Similarly, an observer may be “tested” on 
what he has seen in a brief exposure of a 
complex visual stimulus. Such a test re- 
quires only a partial report. The specific 
instruction which indicates which part of 
the stimulus is to be reported is then given 
only after termination of the stimulus. On 
each trial the instruction, which calls for 
a specified part of the stimulus, is randomly 
chosen from a set of possible instructions 
which cover the whole stimulus. By re- 
peating the interrogation (sampling) pro- 
cedure many times, many different random 
samples can be obtained of an observer's 
performance on each of the various parts 
of the stimulus. The data obtained thereby 
make feasible the estimate of the 
formation that was available to the 
from which to dr. 
age trial. 


total in- 
observer 
aw his report on the aver- 


The time at which the instruction is given 
termines the time at which av. 
formation js sampled. By suitable coding, 
the instruction may be given at any time: 
before, during, or after the stimulus presen- 
tation, Not only the available information 
immediately following the termination of 
the stimulus, but a continuous function 
relating the amount of information available 
to the time of instruction may be obtained 
by such a procedure 

Many 


Which 


de ailable in- 


Studies have been 


conducted in 
observers were 


: required to give 
partial reports, that is, to report only- on 
опе aspect or one location of the stimulus, 
In prior experiments, however, the instruc- 
tions Were often not randomly Chosen, and 
the set of possible instructions did not 
Systematically Cover the stimulus. The no. 
tions of testing or sampling were not ap- 
plied." Tt is not Surprising, therefore, that 


* The experiments referred to ar f 

А сх are (с! 
1999): Ripe (1900), Месе ropes 
(1932), Long, Henneman, and Reid (1953) Long 
amd Lee (12538), Long and Lee (195319 Long 
Reid, and Garvey (1954), Law ence and Cole. 
(1054), Adams (1955), Гав aca | Ls 
(1936), Broadbent (19374). Ы 


Sperling, 
Chapman 


Laberge 
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estimates have not been made of the total 
information available to the observer fol- 
lowing a brief exposure of a complex 
stimulus. Furthermore, instructions have 
generally not been coded in such a way 2 
to make it possible to control the precise 
time at which they were presented. Cor] 
sequently, the temporal course of available | 
information could not have been quantita 
tively studied. In the absence of precise 
data, experimenters have all too frequently 
assumed that the time for which informa- | 
tion is available to the observer corresponds 
exactly to the physical stimulus duration i 
Wundt (1899) understood this -— 
and convincingly argued that, for extreme) 
short stimulus durations, the assumption | 
that stimulus duration corresponded " 
the duration for which stimulus Шш 
tion was available was blatantly false, bu | 


= available | 
he made no measurements of availa | 
information, 


The 
ducted t 


соп“ 


following PXperiments were 
S expe гта 


© study quantitatively the info ayer 
tion that becomes available to an observ". 
following a brief exposure. Lettered Aes | 
were chosen because these contain à pon 
lively large amount of information per Не 
and because these are the kind of T 
that have been used by most previous ! 


. ы c i 5 aré 
vestigators. The first two experiments at 
essentially contro] experiments; the) ory 
tempt to confirm that immediate-mem” < 


for letters is inc 
9! stimulation, 
characteristic, 


1 : ters 
lependent of the ИТ 
that it is an ipii, 
In the third experiment tel) 
number of letters available immedia el 
afta. . " A g is © 
atter the €xtinction of the stimulus iS wl 


armine, А Н ing arti 
termined by means of the sampling (ра! 


i ) phe 
report) procedure described above. ail’ 

е M rat © Me 
fourth experiment explores decay of a aft 
able Information 


with time. The ane 
nes some exposure par ue 
In the sixth experiment a techn" nt 
which fails to demonstrate a large ато й. 
of available information is 
Che seventh experiment deals with the "def 


= А: 3 Е А . ОГ 
ot the historically important variable: ° 
ot report, 


experiment exam} 
eters, 
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invest“ је 
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Apparatus. The experiments utilized a Gerbrands 
tachistoscope.® This is a two-field, mirror tachisto- 
scope (Dodge, 1907b), with a mechanical timer. 
Viewing is binocular, at a distance of about 24 
inches. Throughout the experiment, a dimly il- 
luminated fixation field was always present. 

The light source in the Gerbrands tachistoscope 
is a 4-watt fluorescent (daylight) bulb. Two such 
lamps operated in parallel light each field. The 
operation of the lamps is controlled by the micro- 
switches, the steady-state light output of the lamp 
being directly proportional to the current. How- 
ever, the phosphors used in coating the lamp con- 
tinue to emit light for some time after the cessation 
of the current. This afterglow in the lamp follows 
an exponential decay function consisting of two 
parts: the first, a blue component, which accounts 
for about 40% of the energy, decays with a time 
constant which is a small fraction of a millisecond ; 
the decay constant of the second, yellow, component 
was about 15 msec. in the lamp tested. Fig. 1 


GE 


INTENSITY-LINEAR SCALE 


50 MILLISECONDS PER DIVISION 


Гу i | 
used in most ol the experiments. 
a photograph of an oscilloscope trace) 


"E 
Arlington 74, Massachusetts. 


А 50-millisecond light flash, such as was 
(Redrawn from 


Иры Gerbrands Company, 96 Ronald Road, 


illustrates a 50-msec. light impulse on a linear 
intensity scale. The exposure time of 50 msec. 
was used in all experiments unless exposure time 
was itself a parameter. Preliminary experiments 
indicated that, with the presentations used, ex- 
posure duration was an unimportant parameter. 
Fifty msec. was sufficiently short so that eye 
movements during the exposure were rare, and 
it could conveniently be set with accuracy. 

Stimulus materials. The stimuli used in this 
experiment were lettered 5x8 cards viewed at a 
distance of 22 inches. The lettering was done 
with a Leroy No. 5 pen, producing capital letters 
about 0.45 inch high. Only the 21 consonants 
were used, to minimize the possibility of Ss inter- 
preting the arrays as words. In a few sets of cards 
the letter Y was also omitted. In all, over 500 
different stimulus cards were used. 

There was very little learning of the stimulus 
materials either by the Ss or by the E. The only 
learning that was readily apparent was on several 
stimuli that had especially striking letter com- 
binations. Except for the stimuli used for train- 
ing, no S ever was required to report the same 
part of any stimulus more than two or three times, 
and never in the same session. 

Figure 2 illustrates some typical arrays of 
letters. These arrays may be divided into several 
categories: (a) stimuli with 3, 4, 5, 6, or 7 letters 
normally spaced on a single line; (b) stimuli with 
six letters closely spaced on a single line (6- 
massed); (c) stimuli having two rows of letters 
with three letters in each row (3/3), or two rows 


LQDKKJ 


ZYVVFF 


Fic. 2. Typical stimulus materials. Col. 1: 3, 5, 


6, 6-massed. Col, 2: 3/3, 4/4, 3/3/3, 4 4,4 LEN. 
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of four letters each (4/4); (d) stimuli having 
three rows of letters with three letters in each 
row (3/3/3). The stimulus information, calculated 
in bits, for some of the more complex stimuli is 
26.4 bits (6-letters, 6-massed, 3/3), 35.1 bits (4/4), 
and 39.5 bits (3/3/3). 

Tn addition to stimuli that contained only letters, 
зоте stimuli that contained both letters and num- 
bers were used. These had eight (4/4 L&N, 
35.7 bits) and twelve symbols (4/4/4 L&N, 53.6 
bits), respectively, four in each row. Each row had 
two letters and two numbers—the positions being 
randomly chosen. The 5 was always given a sample 
Stimulus before L&N stimuli were used and told 
of the constraint above. He was also told that 
O when it occurred was the number "zero" and was 
not considered a letter. Calculated with these con- 
straints, the information in each row of four 
letters and numbers (17.9 bits) on such a card 
is nearly equal to the information in a row of four 
randomly chosen consonants (17.6 bits), even 


though there are different kinds of alternatives in 
each case. 


_ Subjects, The nature of the experiments made 
it more economical to use small numbers of trained 
Ss rather than several large groups of untrained Ss, 
Four of the five Ss in the experiment were ob- 
tained through the student employment service, 
The fifth $ (RNS) was a member of the faculty 
Who was interested in the research. Twelve ses- 
51015 were regularly scheduled for each S, three 
times weekly, 


Instructions and trial procedures. $ was in- 
structed to look at the fixation cross until it was 
clearly in focus; then he pressed a button which 

| ion after а 0.5-sec, delay. 

15 procedure constituted an approximate be- 


N of the degree of dark adaptation 
Prior to the exposure, namely, the ability to focus 


fixation cross, 


ded on a speciall 
response grid. A response р oi eec 


Squares on the response 
they were not certain, TH 


Series of 5 to 20 trials Were grouped together 
without а change in conditions, 


n a sequence of 


ponding, 
(except ND) Preferred rapid ды жш pene 


Set by the Е’ 
and instruction 


tones. This was about three to four stimuli per 
minute. i 
Each of the first four and last two sessions 
began with and/or ended with a simple taski 
the reporting of all the letters in stimuli of 3 
4, 5, and 6 letters. This procedure was under- 
taken in addition to the usual runs with these 
stimuli to determine if there were appreciable 
learning effects in these tasks during the cours 
of the experiment and if there was an accuracy 
decrement (fatigue) within individual sessions 
Very little improvement was noted after the secda 
session. This observation agrees with p ph. 
reports (Whipple, 1914), There was little di 
ence between the beginning and end of one 
Scoring and tabulation of results, Every гер 
of all Ss was scored both for total number in 
letters in the report which agreed with letters 
the stimulus and for the number of letters n 
in their correct positions. Since none of the Ther 
cedures of the experiments had an effect on we. 
of these scores independently of the other, ition 
the second of these, letters in the correct post 
is tabulated in the results, This score, which sin 
Position into account, is less subject to gues tet’ 
сггог, and in some cases it is more readily sition 
preted than a score which does not take port 0 
into account. As the maximum correction, 4/4 
guessing would be about 0.4 letter for the all 
(12-letter) material—and considerably less {07 the 
other materials—no such correction is made i? n 
treatment of the data. In general, data were 
tabulated more accurately than 0.1 letter. wert 
Data from the first and second sessions У 
not uscd if they fell below an S's average ions 
formance on these tasks in subsequent т. 
This occurred for reports of five and of, rion 
(4/4) letters for some Ss. A similar bo 
applied in later Sessions for tasks that Мне 
initiated later, In this case, the results O d ifi 
first "training" session(s) are not incorporate 02 
the total tabulation if they lie more ther al 
letter from S's average in subsequent 505510: 


e = ЕР: ПОИНИ: 
BRSS SB Зея 


Sends EX den e 
cand в gp аа а, 


When an S is required to give a comple, 
(whole) report of all the letters on a brich? 
exposed stimulus, he will generally not 
— 


„he 
. rs in 
ai E there are à large number of letters neers 
"E us, the Probability that these same 16; #2 
will appear somewhere on the response grid, ^. of 


Spective of Position, becomes very high whet 16 jus 
not S has much 


In the limit, the 


Experiment 1: ЛИН | 


information about the 28707 
: correspondence approach ip 
Provided only that the а fede of y ( 
letter in the response matches its relative e the 
quency of occurrence in the stimulus pack. det 
Tesponse is scored for both letter and positio? tid 
the Percent guessing correction is indepen 
changes in stimulus size. 
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port all the letters correctly. The average 
number of letters which he does report 
correctly is usually called his immediate- 
memory span or span of apprehension for 
that particular stimulus material under the 
stated observation conditions. Ап expres- 
sion such as immediate-memory span 
(Miller, 1956a) implies that the number 
of items reported by S remains invariant 
with changes in stimulating conditions. The 
present experiment seeks to determine to 
what extent the span of immediate-memory 
is independent of the number and spatial 
arrangement of letters, and of letters and 
numbers on stimulus cards. If this inde- 
pendence is demonstrated, then the quali- 
fication "for that particular stimulus 
material” may be dropped from the term 
immediate-memory span when it is used 
in these experiments. 

Procedure, Ss were instructed to write all the 
letters in the stimulus, guessing when they were 
not certain. All 12 types of stimulus materials 
were used. At least 15 trials were conducted with 
each kind of stimulus with each 5. Each S was 
given at least 50 trials with the 3/3 (6-letter) 
Stimuli which had yielded the highest memory 
Span in preliminary experiments. The final run 
made with any kind of stimulus was always а test 
ОЁ immediate-memory. This procedure insured that 
Ss were tested for memory when they were 
maximally experienced with a stimulus. | 

Results. The lower curves in Fig. 3 
represent the average number of letters 
correctly reported by each 5 for each ma- 
terial." The most striking result is that 
immediate-memory is constant for each S, 
being nearly independent of the kind of 
stimulus used. The immediate-memory span 
for individual Ss ranges from approxi- 
mately 3.8 for JC to approximately 5.2 for 
NJ with an average immediate-memory 
span for all Ss of about 4.3 letters. (The 
upper curves are discussed later.) m 

The constancy which is characteristic of 
individual immediate-memory curves of 
Fig. 3 also appears in the average curve for 
all Ss. For example, three kinds of stimuli 
were used that had six letters each: six 
letters normally spaced on one line, 6- 


En 

т Gee Sperling (1959) for tables giving the 
numerical values of all points appearing in this and 
in all other figures. 


massed, and 3/3-letters (see Fig. 2). When 
the data for all Ss are pooled, the scores 
for each of these three types of materials 
are practically the same: the range is 4.14.3 
letters. The same constancy holds for 
stimuli containing eight symbols. The aver- 
age number of letters correctly reported for 
each of the two different kinds of eight 
letter stimuli, 4/4, 4/4 L&N, is nearly the 
same : 4.4, 4.3, respectively. 

Most Ss felt that stimuli containing both 
letters and numbers were more difficult 
than those containing letters only. Never- 
theless, only NJ showed an objective deficit 
for the mixed material. 

In conclusion, the average number of 
correct letters contained in an S’s whole 
report of the stimulus is approximately 


AVERAGE. 
ALL SUBJECTS 


LETTERS CORRECTLY REPORTED 
m ^ o o б в 


а, дате аа 
3456 831 в $456 83910 i? 
NUMBER OF LETTERS IN STIMULUS 


Fic, 3. "Channel capacity curves.” Immediate- 
memory and letters available (output information) 
as functions of the number of stimulus letters 
(input information). Lower curves — immediate- 
memory (Exp. 1); upper curves — letters available 
immediately after termination of the stimulus; 
diagonal lines = maximum possible score (ie, 
input = output). Code: X = letters on one line; 
+ = 6-massed; o = 3/3, 4/4, 5/5; А = 3/3/3; 
О = 4/4 L&N, 4/4/4 L&N. 
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equal to the smaller of (a) the number of 
letters in the stimulus or (b) a numerical 
constant—the span of immediate-memory— 
which is different for each 5. The use of 
the term immediate-memory span is there- 
fore justified within the range of materials 
studied. This limit on the number of letters 
that can be correctly reported is an individ- 
ual characteristic, but it is relatively similar 
for each of the five Ss of the study. 


Experiment 2: Exposure Duration 


The results of E <periment 1 showed that, 
regardless of material, Ss could not report 
more than an average of about 4.5 items 
per stimulus exposure. In order to deter- 
mine whether this limitation was a peculiar 
characteristic of the short exposure dura- 
tion (0.05 sec.), it was necessary to vary 
the exposure duration, 

Procedure, Аз in the 
Were instructed 
stimulus. The sti 
NJ, who was abl 
letters in Experi 
order to make a Possible increment in his accuracy 
The Ss were given 10 
conditions, .015-, .050-, 
-5ес. exposure duration, in the 
order above, In à later Session, additional trials 
Were conducted at 015-sec, exposure as a control 


for Experiment 5. 7 i 
E 5. These trials are averaged wi 
the above data. й E 


Results and 


; ‹500. 


Figure 4 illus- 
TS correctly re- 
the duration of 
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Fic. 4. Letters correctly Feported as a function 
of exposure duration. 


exposure. The main result is that exposure 


i i T 1 n 
duration, even over a wide range, is not an | 


important parameter in determining the 
number of letters an .$ can recall correctly. 
Both individually and as a group, Ss de 
no systematic changes іп .he number 0 
letters. correctly reported as the exposure 
duration was varied from 0.015 to 0.500 sec | 
The invariance of the number of letters 
reported as a function of e porun e. 
tions up to about 0.25 sec. for the kin E 
presentation used (dark pre- and a 
exposure fields) has been knov 
(Schumann, 1904). 


long 
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Experiment 3: Partial Report 


the span of immediate-memory as an i 
variant characteristic of cach 5. Та Exper Й 
ment 3 the principles of testing 3 in 
perceptual situation that were advance 
the introduction are applied in order 


. strated 
periments 1 and 2 have demonstrate" | 


г А jon | 
determine whether $ has more informat 
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available than he can indicate in his lim 


г ) i 
immediate-memory 


report. 
The 5 


А B Б ce a 
before, but he is required only to mak 


partial report. The length of this a 
is four letters or less, so as to lic d 
S's immediate-memory span. The yin 
tion that indicates which row of the sumo. 
is to be reported is coded in the form né 
tone. The instruction tone is given after t w 
visual presentation. The S does not Kno 
until he hears the tone which row is а 
for. This is therefore а procedure paar 
Samples the information that S has ve 
able after the termination of the vist 
Stimulus, 
Proced ure, 
only two lines 


" ы javit 
Initially, Stimulus materials prn 
S was told th 


Were used, that is, 3/3 and 4/4 -pjs 
at a tone would be sounded, that 155 
tone would come approximately simultaneo” 4 
with the exposure, and that it would be eithe ч 
high tone (2500 cps) or a low tone (250 E ne 
tone, he was to write only nly 
Г row of the stimulus; if a low tone card 


Ver TOW. He was ther shown a sample “ow 
3/3 letters and ie 


given several high anc 
tones, It was Suggested that he keep his pe 
fixated on the fixation point and be equally et 
Pared for either tone. It would not be poss! 100" 
Outguess the Е who would be using à d 
sequence of tones, 


А ^ " as 
15 presented with the stimulus Û 
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The tone duration was approximately 0.5 sec. 
The onset of the tone was controlled through 
the same microswitch that controlled the off-go of 
the light, with the completion of a connection irom 
ап audio-oscillator to the speaker. Intensity of the 
tone was adjusted so that the high (louder) tone 
was "loud but not uncomfortable." 

In each of the first two sessions, each S re- 
ceived 30 training trials with each of the materials 
3/3, 4/4. In subsequent sessions Ss were given 
series of 10 or more "test" trials. Later, a third, 
middle (650 cps) tone was introduced to cor- 
respond to the middle row оЁ the 3/3/3 and 4/4/4 
stimuli. The instructions and procedure were es- 
sentially the same as before. 

In any given session, each tone might not occur 
with equal frequency for each type of stimulus. 
Over several sessions, usually two, this unequal 
frequency was balanced out so that an S had an 
exactly equal number of high, medium, and low 
tones for each material If an 5 “misinterpreted” 
the tone and wrote the wrong row, he was asked 
to write what he could remember of the correct 
row. Only those letters which corresponded to the 
row indicated by the tone were considered. 


Treatment of the Data. In the experiments 
considered in this section, S is never required to 
report the whole stimulus but only one line of a 
possible two or three lines. The simplest treatment 
is to plot the percentage of letters correct. This, 
in fact, will be done for all later comparisons. 
"The present problem is to find a reasonable meas- 
ure to enable comparison between the partial 
report and the immediate-memory data for the same 
stimuli, The measure, percent correct, does not 
describe the results of the immediate-memory 
experiments parsimoniously. In Experiment 1 it 
was shown that Ss report a constant number of 
letters, rather than a constant percentage of 
letters in the stimulus. The measure, number of 
letters correct, is inappropriate to the partial re- 
port data because the number of letters which 5 
reports is limited by the E to at most three or 
four, The most reasonable procedure is to treat 
the partial report as a random sample of the 
letters which the S has available. Each partial 
report represents а typical sample of the number 
of letters S has available for report. For ex- 
ample, if an S is correct about 90% of the time 
when he is reporting three out of nine letters, then 
he is said to have 90% of the nine letters—about 
eight letters—available for partial report at the 
time the instruction tone is given. 

In order to calculate the number of available 
letters, the average number of letters correct in 
the partial report is multiplied by the number of 
equiprobable (nonoverlapping), partial reports. If 
there are two tones and two rows, multiplication 
is bv 2.0; if three, by 3.0. As before, only the 


number of correct letters in the correct position is 


considered. 


Results. The development of the final, 
stable form of the behavior is relatively 
rapid for Ss giving partial reports. The 
average for all Ss after 30 trials (first 
session) with the 3/3 stimuli was 4.5; on 
the second day the average of 30 more 
trials was 5.1. On the third day Ss aver- 
aged 5.6 out of a possible six letters. Most 
of the improvement was due to just one 5: 
ND who improved from 2.9 to 5.8 letters 
available. In the 3/3/3 stimulus training, 
all Ss reached their final value after the 
initial 40 trials on the first day of training. 
The considerable experience Ss had ac- 
quired with the partial reporting procedure 
at this time may account for the quick 
stabilization. NJ, whose score was 7.7 
letters available on the first 20 trials, was 
given almost 150 additional trials in an 
unsuccessful attempt to raise this initial 
core. 


In Fig. 3 the number of letters available 
as a function of the number of letters in 
the stimulus are graphed as the upper 
curves. For all stimuli and for all Ss, the 
available information calculated from the 
partial report is greater than that contained 
in the immediate-memory report. Moreover, 
from the divergence of the two curves it 
seems certain that, if still more complex 
stimuli were available, the amount of avail- 
able information would continue to increase. 


us 


The estimate above is only a lower bound 
on the number of letters that 5$ have avail- 
able for report after the termination of the 
stimulus. An upper bound cannot be ob- 
tained from experiments utilizing partial 
reports, since it may always be argued that, 
with slightly changed conditions, an im- 
proved performance might result. Even the 
lower-bound measurement of the average 
available information, however, is twice 
as great as the immediate-memory span. 
The immediate-memory span for the 4/4/4 
(12-letters and numbers) stimuli ranges 
irom 3.9 to 4.7 symbols for the Ss. with 
an average of 4.3. Immediately after ап 
exposure of the 4/4/4 stimulus material, 
the number of letters available to the S's 
ranged from 8.1 (ND) to 11.0 (ROR), 
with an average of 9.1 letters available. 
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This number of letters may be transformed 
into the bits of information represented by 
so many letters. For the 4/4/4 (12-letters 
and numbers) material, the average number 
of bits available, then, is 40.6, with a range 
from 36.2 to 49.1 (out of a possible 53.6 
bits). These figures are considerably higher 
than the usual estimates. For example, in 
а recent review article Quastler (1956) 
writes: "All experimental studies agree 
that man can . . . assimilate less than 25 
bits per glance" (p. 32). The data obtained 
in Experiment 3 not only exceed this maxi- 
mum, but they contain no evidence that the 
information that became available to the 
Ss following the exposure represented a 
limit of "man" rather than a maximum 
determined by the limited information con- 
tained in the stimuli which were used. 


Experiment 4: Decay of Available 
Information 


Part 1: Development of Strategies of 
Observing 


It was established in Experiment 3 that 
more information is available to the Ss 
immediately after termination of the 
stimulus than they could report. It remains 
to determine the fate of this surplus in- 
formation, that is, the “forgetting curve.” 
The partial report technique makes possible 
the sampling of the available information at 
the time the instruction signal is given. By 
delaying the instruction, therefore, decay of 
the available information as a function of 
time will be reflected as а corresponding 
decrease in the accuracy of the report. : 


given five or 
е above 


randomly. The advantages of this procedure а 
(a) optimal performance is most likely in each 
delay condition, if S is prepared for that precise 
condition (cf. Klemmer, 1957), (b) minimizing 
delay changes makes possible a higher rate © 
stimulus presentations. On the other hand, d 
random sequence of instruction tone delays wou 
make it more likely that S was “doing the same 
thing" in each of the different delay conditions. 

The sequence in which the different delay con- 
ditions followed each other was chosen either E. 
that given above (ascending series of delay pe 
ditions) or in the reverse order (descen Es 
series). Within a session, a descending d 
always followed an ascending series and vice versa, 
irrespective of the stimulus materials used. E. 
least two ascending and two descending series au 
delay conditions were run with each S and ke 
each material after the initial training (Ехре " 
ment 3) with that material. This number of lier 
insures that for each S there are at least 20 tr!? 
at each delay of the indicator tone. 


Results and discussion. The developmen 
of the typical behavior is illustrated by uw 
5, ROR, in Figs. 5a, b, с. Figure 5a - 
ROR's performance in a single session, t! 
first posttraining session. The upper ай 
lower curves represent the ascending 2” 
descending series of tone instruction delays 
respectively. The arrows indicate the i, 
Although each point is based upon ОП 7 
five trials, the curves are remarkably simil 
and regular. Clearly, most of the и. 
in excess of ROR's memory span are Тогу 
gotten within about 0.25 sec. The rapid ° A 
getting of these letters justifies calling this 
short-term memory and accounts for E 
tact that it may easily be overlooked unde? 
less than optimal conditions. In the m 
ing session (Fig. 5b) the descending Sef 
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was given first. Here orderly behavior dis- 
integrates. In the third session (Fig. 5c) 
two modifications were introduced: (a) the 
number of trials in each delay condition was 
increased to eight and (b) а new delay con- 
dition was given—namely, a signal tone com- 
ing 0.05 sec. before the stimulus onset. The 
curves are again regular, but they are ob- 
viously different for the ascending and de- 
scending series. l'or the session indicated in 
Fig. 5c, an analysis of the errors by position 
shows that in the ascending series the errors 
are evenly split between the top and bottom 
rows of the stimulus; in the descending 
series, the top row is favored 3:1. . 
ROR's performance is analyzable in 
terms of two kinds of observing behavior 
(strategies) which the situation. suggests. 
He may follow the instruction, given by B 
prior to training, that he pay equal attention 
to each row. In this case, errors are evenly 
distributed between rows. Or, he may try 
to anticipate the signal by guessing which 
instruction tone will be presented. In this 
case, S is differentially prepared to report 
one row. If the signal and S’s guess com- 
cide, 5 reports accurately; if not, poorly. 
Such a guessing procedure would lead to 
the variability observed in Fig. 5b. On the 
other hand, S may prefer always to antici- 
pate the same row—in the case of ROR 
(Fig. 5с, descending series), the top row. 
This would again allow reliable scores, pro- 
vided only that there are an equal number 
of instruction signals calling for the top and 
bottom rows. Concomitantly, а differential 
accuracy of report for the two rows 1S 
observed. (ROR's preference for the top 
row is again prominent in Experiment 7, 
Figs. 10, 11.) MS. 
Equal attention responding 1$ initially 
reinstated on the third day. The obvious 
change in procedure which is responsible 
is the introduction of a tone 0.05 sec. before 
the stimulus onset (—0.10 sec. "after" its 
termination). This signal is sufficiently in 
advance of the stimulus so that perfect 
responding is possible by looking at only 
the row indicated by the signal tone. ROR 
scores 100%, both in this condition and in 
the succeeding zero delay. The whole 
(ascending series) decay curve of Fig. 5c 


is highly similar to that of Fig. 5a. A run 
with 3/3 stimuli was interposed between 
the ascending and descending series shown 
in Fig. 5c. Since the guessing procedures 
were easily sufficient for a nearly perfect 
score with 3/3 materials, when the descend- 
ing series of delay conditions was run, ROR 
continued guessing. While guessing was 
advantageous at the long delays, at the short 
delays it was a decided disadvantage. 
Figure 6 illustrates the performances of 
RNS, a sophisticated observer. RNS de- 
scribed the two strategies (equal attention, 
guessing) to E. In accordance with the 
instructions to do as well as he could, RNS 
said that he switched from the first to the 
second strategy at delays longer than 0.15 
sec. Thus in the three short delay con- 
ditions, RNS divided his errors almost 
evenly (19:21) between the favored (top) 
and the unfavored rows; in the two longer 
delays, errors were split 4:26. The dip in 
the curve indicates that RNS did not switch 
strategies quite as soon as he could have, 
{ог optimal performance. Such а dip is 
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characteristically seen in experiments of this 
kind. | 

The other Ss exhibit similar curves.’ 
These are not presented, as the main fea- 
tures have already been demonstrated by 
ROR and RNS. In summary, the method 
of delaying the instruction tone is a feasible 
one for determining the decay of the short- 
term memory contents, but experience with 
the difficult, long delays causes a consider- 
able increase in the variability of Ss' per- 
formance which is carried over even to the 
short delay conditions. This has been at- 
tributed to Ss’ change from an equal atten- 


tion toa guessing Strategy in observing the 
stimulus. 


Part 2: Final Level of Performance 


The analysis of the Preceding experiment 
has indicated that two distinct kinds of 


observing behavior develop when the in- 
struction to report is delayed. The accuracy 
of report resulting from the first of these 
i is correlated 
he instruction tone; it is 
€ Ss initially giving equal 
arts of the stimulus, The 
accuracy of the other kind of report (guess- 

ated with the delay of the 
it is characterized by Ss’ differ- 
me part of the 
Equal attention ob- 
further study 


instruction : 
ential prepa 
stimulus ( 


essing, 
1. The use ‹ 
of letters, that is, 3/3/3 and 4/4/4, Dif- 


„` See Sperling (1959) for 
vidual average 


tables Containing indi. 
delay of instru 


5 of all trials for each $ 
ction tone : 3/3, 4/4. 
? Increase in Variability (ay 

ment in accuracy and/or pes eo dor- 
after difficult conditions, 
Thomas (1957) in a cli 
reported an exactly 
nomenon in a study of 
Hysteresis refers to Н 


at each 


tha larger number 


ferential attention to a constant small port 
of the stimulus is less likely to be а 
forced, the larger the stimulus. 1 he npe 
three tones instead of two ыы 
probability of guessing the correct өөү | 
2. Training with instruction age A 
begin slightly before the onset ee 
stimulus. [t is not necessary for 5 to J ж 
in this situation since he can я lo. 
depending upon the instruction tone a E 
This situation not only makes equal Lid 
likely to occur, but differentially reinforce 
it when it does occur. - 
When delays of longer length are eos 
priority should be given to an Been Бе 
series of delays so that $ will, at = pe 
ginning of a particular delay капарга ue 
a high probability of entering with ИУ 
sired observing behavior. This. pape 
might be nearly 1.0 by interposing 8 im 
of trials on which the instruction is Senis 
in advance of, or immediately upon, a 
nation of the stimulus and requiring: tha ie 
perform perfectly on this task E к 
can continue to the particular delay bw. 
tested.!?^ This tedious procedure was pow 
but, as it did not have an appreciable тив 
upon the results, it was discontinued. 
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б "NE et veel 
problem 15 that Ss learn to switch m 
the two mod 
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3. The Е may be able to à 
control over Ss" modes of responding. 
itiaily, howey even S cannot control 
own behavior actly. This suggest a 
limit to what Е can do. For example, h 
quently 55 reported that, although they -0Wi 
tried to þe equally prepared for each 10 


2 o | 1 
€$ of behavior in a small 


„рай 
vain verba 
^ Jn- 
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Some verbal control is, of course, pos- 
sible. An instruction that was well under- 
stood was: 

You will see letters illuminated by a flash that 

quickly fades out. This is a visual test of your 
ability to read letters under these conditions, not 
а test of your memory. You will hear а tone 
during the flash or while it is fading which will 
indicate which letters you are to attempt to read. 
Do not read the card until you hear the tone, 
[etc.]. 
The instruction was changed at the midway 
point in the experiment. The S was no 
longer to do as well as he could by any 
means, but was limited to the procedure 
described above. Part 2 of this experiment, 
utilizing 9- and 12-letter stimuli, was carried 
out with the three modifications suggested 
above. 

Results. The results for 3/3/3 and 4/4/4 
letters and numbers are shown for cach 
individual S in Figs. 7 and 8. The two 
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ordinates are linearly related bv the equa- 
tion: 
percent correct 
100 


no. letters in stimulus — letters available 


Each point is based on all the test trials in 
the delay condition. The points at zero 
delay of instructions for NJ and JC also 
include the training trials, as these Ss 
showed no subsequent improvement. 


The data indicate that, for all Ss, the 
period of about one sec. is a critical one 
for the presentation of the instruction to 
report. If Ss receive the instruction 0.05 
sec. before the exposure, then they give 
accurate reports: 91% and 82% of the 
letters given in the report are correct for 
the 9- and 12-letter materials, respectively. 
These partial reports may be interpreted to 
indicate that the Ss have, on the average, 
8.2 of 9 and 9.8 of 12 letters available. 
However, if the instruction is delayed until 
one sec. after the exposure, then the ac- 
curacy of the report drops 3262 (to 69%) 
for the 9-letter stimuli, and 4456 (to 38%) 
for the 12-letter stimuli. This substantial 
decline in accuracy brings the number of 
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Fic. 8. Decay of available information: twelve 
(4/4/4) letters and numbers. Light fash is shown 
on same time scale at lower left. Dar at right 
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12 


letters available very near to the number of 
letters that Ss give in immediate-memory 
(whole) reports. 


The decay curves are similar and regular 
for each S and for the average of all Ss. 
Although individual differences are readily 
apparent, they are small relative to the 
effects of the delay of the instruction. For 
example, when an instruction was given 
with zero delay after the termination of 
the stimulus, the least accurate reports by 
any Ss are given by ND, who has 8.1 letters 
available immediately after the termination 
of the stimulus. With a one-sec. delay of 
instructions, the most accurate reports were 
given by JC, who has only 5.1 letters avail- 
able at this time. 

In Fig. 3, in which whole reports and 
partial reports were compared, only that 
particular partial report was considered in 
Which the instruction tone followed the 
stimulus with zero delay. Tt is evident from 
this experiment that the zero delay instruc- 
tion is unique only in that it is the carliest 
possible “after” instruction, but not because 


of any functional difference, 


therefore, the 0.15-, 0.50-, and 
ауз are also plotted 
for the six- (3/3) 


In Fig. 9, 
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for one $ ; ROR. Data 
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and eight- (4/4) letter stimuli are taken 
from the two ascending delay series with 
each material that yielded monotonic re- 
sults. Figure 9 clearly highlights the sig- 
nificance of a precisely controlled coded 
instruction, given within a second of E 
stimulus off-go, for the comparison 0 
partial and immediate-memory reports. One 
second aíter termination of the stimulus, 
the accuracy of ROR’s partial reports is E 
longer very different from the accuracy 0 
his whole reports. 


s z s н $ 
Experiment 5: Some Exposure Parameter 


In Experiment 3 it was shown that ш 
number of letters reported correctly i$ E 
most independent of the exposure durat 
over a range from 15 to 500 msec. Tt x 
well known, however, that the relation be 
tween the accuracy of report and the ее. 
posure duration depends upon the рге- ап 
post-exposure fields (Wundt, 1899). 15 

In a technique developed in Helmholtz 
laboratory (Baxt, 1871) the information? 
(stimulus) field is followed, after а vari B 
delay, by a noninformational, һотодепео 
bright postexposure field. Using thi 
method, Baxt showed that the number E. 
reportable letters was а nearly linear t 
tion of the delay of the bright роѕі-ехроѕи! 
field: t- 

Other combinations of pre- and pos 
exposure fields have also been tried (Dodge 
1907a). The usual tachistoscopic presenta” 
tion utilizes gray pre- and роѕі-ехроѕш* 
fields (Woodworth, 1938). Baxt's Pro" 
cedure, however, {5 the most disadva? 
tageous for the observer, A simila 
procedure was therefore selected, in ord 
TB im port: ibed by 

rta cr: 
Ladd (1899) a Tames Е exited 


ut it is no longer well known. Consequent 1¢ 
has been “rediscovered » most recently, by Lin eat 
and Emmons (1958). ' 
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to study whole and partial reports in a 
vastly different visual presentation from 
that of the previous experiments. 

Procedure. 1. Ss were instructed to write all 
the letters of the stimulus; 3/3 stimuli were used. 
After several sample presentations of a stimulus 
card followed by a light post-exposure field, 5$ 
were given a random sequence of normal (pre- 
and post-exposure fields dark) and Baxt (pre- 
exposure dark, post-exposure field light) trials. 
The Baxt trials do not correspond exactly to the 
Presentation that Baxt used. In this experiment, 
the post-exposure field is the same intensity as 
the stimulus (informational) field, whereas Baxt 
usually used more intense post-exposure fields; 
also, the stimulus field always remains on until 
the onset of the post-exposure field, whereas Baxt 
used a fixed five-msec. duration for the stimulus 
field, The post-exposure field itself remains on 
for about one sec. The pre-exposure field is 
always dark, as in all the previous experiments. 
Two exposure durations, 0.015 and 0.050 sec., were 
tested. 

2. Three Ss were tested with the Baxt presenta- 
tion of a 3/3/3 stimulus at an exposure duration 
of 0.015 sec. The partial report procedure was 
used to determine the effects of the post-exposure 
field on the number of letters available. 


3. The same three Ss were run as their own 
controls, The procedure was exactly the same as 
in Paragraph 2 above except that the post-exposure 
field was normal (dark). 

Results. The complete results are given 
in Tables 1 and 2. In all tests, the Baxt 
procedure reduces the response accuracy of 
all Ss to about one-half of their normal 
score. This finding confirms the earlier 
studies. However, a linear relation between 
exposure duration and the number of letters 
reported was not observed. The failure to 
find a linear relation may be due to the 
previously mentioned differences between 
the presentations. 

Tor RNS and ND, the number of letters 
available is nearly the same (about two) 
in the partial report of 9-letter stimuli as 
the whole report of 6-letter stimuli. The 
fact that in both procedures the number of 
letters given by Ss is the same suggests 
that a Baxt presentation reduces the num- 
ber, or the length of time that letters are 
available, and that it does not directly affect 
the immediate-memory span. 

ROR's partial reports of Baxt presenta- 
tions are considerably more accurate than 
those of the other Ss, although they are not 


TABLE 1 


А COMPARISON OF RESPONSE ACCURACY WITH 
Two DIFFERENT Posr-ExPoscRE FIELDS 


Subject Exposure (sec.) Normal | Baxt 
RNS (0.015) 3.9 2.5 
(0.050) 4.0 9.2 

ROR (0.013) 4.8 3.5 
(0.050) 4.4 2.8 

ND (0.015) 3.8 1.9 
(0.050) 3.7 2.3 

NJ (0.015) 5.1 2.4 
(0.050) 5.4 3.4 

Jc (0.015) 4.1 Dai 
(0.050) 3.8 3.4 

Mean (0.015) 4.3 2.6 
(0.050) 4.3 2.8 


Note.—Number of letters correctly reported, Whole report 
of six (3/3) letter stimuli. Normal = pre- and post-exposure 
fields dark; Baxt = pre-exposure field dark, post-exposure 
field white. 


as accurate as his reports in control pre- 
sentations. ROR seemed to show improve- 
ment on successive Baxt trials. JC, another 
S who seemed to show improvement, was 
given additional Baxt trials on which he 
continued to improve slowly. Unfortunately, 
it was unfeasible to determine the asymp- 
totic performance of these two Ss. Whether 
the difference in performance between 
ROR, and RNS and ND is attributable to 
Some overt response, such as squinting or 
blinking, was not determined. 

Table 2 also enables the comparison of 
0.015- and 0.050-sec. exposures of 3/3/3 
stimuli. A decrease in exposure duration 
has only a slight effect on the number of 
letters available. This suggests, as in the 
immediate-memory experiments, that the 
duration of a tachistoscopic exposure is 
not as important a determinant of the num- 
ber of letters available as the fields which 
follow the exposure. 


Experiment 6: Letters and Numbers 


In Experiment 3 partial reports were 
found to be uniformly more accurate than 
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TABLE 2 


А м ES 5 yw wo DIFFERE Posr-ExPosuRE FIELDS 
7 1 FFERENT Post-Expos 
COMPARISON OF RESPONSE Accuracy wirH T DIFFER 


0.15 
Delay of Instruction (sec.): 0.0 
0.015 0.05 0.015 0.05 
Exposure Duration (sec.): .015 
Subject 
5 6.6 
3.7 5.4 
RNS (N) 8.0 8.7 Y 
(B) 2.0 е 
ROR (№) 8.6 8.9 gs 
(B) 6.3 " 
5 5.8 5.4 
М 1:8 7.0 5.8 
"d (8) 2.2 “7 
š 7.2 
Mean (N) 8.0 8.2 ma 
(B) 3.5 ө А 
stimuli: nd 
Е n SEE BEER berat in stimulus), Stim” Fel 
взу Taaperot Е о р ан ЕЕ 
white). 


Whole reports, In 
letters were used 
letters was report 
to be reported by 
of a number of po: 
ing experiment, 


опе case, stimuli of eight 
and only one row of four 


Partial report agai 
symbols, but thes 
letters or as 
In addition, 


ducted, 
Procedure, 1, 


hear the lon 
tones were th f ving 
the stimulus off-go, The stimuli were 4/4 LEN. 
5 » tests were con- 
Structions : 

given well т ad- 


S to write only the letters in the 


following card(s), (8 trials) 
2. Numbers only—Write only 
the following card(s). (8 trials) 
3. Top only—Write only 
following card(s). (4 trials) 


the numbers in 


the top row in the 


row 

4. Bottom only—Write only the bottom 

in the following card(s). (4 trials) р num- 
5. Instruction tone— Write either jeter s scc 

bers as indicated by tone, Tone onset rag also 

before stimulus onset. (16 trials) ROR Ки 

given additional trials at longer delay tim 


5 


аш 
Results. The results are illustrated 
Table 3. Гог purposes of -ÉÁ—Ü 
number of correct letters is multipli vhich 
two when an instruction was used weigh 
required S to report only four of Judes 
symbols of the stimulus, This inc 


the 
instructions given well in advance О 
stimulus. АП Measures, then, have “thi 
the top score and 


are thus equivalent rure 
Percent correct Hal 
he range is 0-8 instead of 0-100. 9€ ar 
which аге based on partial reports rti 
therefore directly comparable to the P. i 
report scores (letters available) obtainc 

Experiments 3, 4, and 5, um 
When stimuli consist of letters and p 
bers, but Ss Teport only the letters oF or 
the numbers, then the Ss’ partial MT 
are only negligibly more accurate than tepe 
Whole reports of the same stimuli. cal 
average number of letters available t И 
culated from the partial report) is jS ай 


letter above the immediate-memory 


a scale factor to 


© 
al 
in 
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TABLE. 3 


COMPARISON OF FIVE PROCEDURES 


Letters Numbers Average Instr. tone Immediate- One row 
Subject only only L&N —0.10 memory only 
RNS 5.0 ®. 4.8 4.3 4.6 
ROR 6.5 6.5 6.5 6.3 4.5 
ND 3.5 3.8 3.6 4..1 4.1 
NJ 4.0 5.0 4.5 1.6 4.3 = 
Е 3:3 4.0 3.6 3.4 4.1 8.0 
Mean 4.5 4.5 4.6 4.5 4.3 = 
Note.—Average letters and/or numbers available (fraction of letters—numbers—correct in partial report X number of symbols 
in stimulus), Stimuli: eight (4/4) letters and numbers. 
series, although checks with other Ss con- 


for the same material. Гог practical pur- 
poses, the partial report score is the same 
score that Ss would obtain if they wrote 
all the letters and numbers they could (that 
is, gave a whole report) but were scored 
only for letters or only for numbers, in- 
dependently by the experimenter. The par- 
tial report of letters only (or of numbers 
only) does not improve even when the in- 
struction is given long in advance verbally 
instead of immediately before the exposure 
by a coded signal tone. 

The estimate of the number of available 
letters and numbers which is obtained from 
the partial report of letters (or numbers) 
only is also the same as the estimate that 
would be obtained if, on each trial, Ss wrote 
only one row—either the top or the bottom 
—according to their whim. Reporting only 
one row of four letters and numbers is a 
task at which the Ss succeed with over 90% 
accuracy. Even if they are scored for the 
whole stimulus, by arbitrarily reporting 
only one row they would still achieve a 
score of almost 50% correct or almost four 
letters available. This is why no delay 
series were conducted. If Ss had ignored 
the instruction to write only the letters (or 
numbers) and had written only a single 
row on each trial, they would have shown 
less than a 0.5 letter decrement, no matter 
what the delay of the instruction. 

Only ROR showed a substantial improve- 
ment when reporting only the numbers (or 
letters). He was the only S with whom it 
made sense to conduct a systematic delay 


firmed this conclusion. Table 4 indicates 
that two extra symbols are available to 
ROR for report only when the tone is given 
before the stimulus, but not if it is given 
immediately after. It should be noted that 
the information in the instruction tone 
comes only after it has been on for 0.05 sec. 
At this time it either continues or is termi- 
nated. The actual "instruction" is thus 
given 0.05 sec. after the tone onset. ROR 
therefore requires that the instruction be 
given within 0.05 sec. of the stimulus 
termination if any benefit of the partial 
report procedure is to be retained. 

Whether the Ss would have shown im- 
provement with a large amount of addi- 
tional training in the partial report of letters 
or numbers cannot be stated. Table 3 shows 
that, when Ss are required in advance to 
report only one row, this task is trivial. The 
substantial advantage of partial reports of 
rows (report by position) over partial re- 


TABLE 4 


PARTIAL REPORTS OF LETTERS OR NUMBERS 


| 
Prior | Delay of ud Tone | Immedi- 


Subj. | Verbal| |. 


v (sec.) ate 
| Instr. 10.10 | оо |-0.25 Memory 
— f. — — К = 
| | | | 
ROR | 6.5 6.3 | 4 4.4 | 4.8 
| 


verage of symbols available (fraction of letters— 
rect in partial report X number of symbols in 
Stimuli: eight (1/4) letters and numbers. 
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ports of numbers or letters (report by 
category) when the instruction is given 
verbally long in advance of the exposure 
is retained even when the instruction is 
coded and given shortly after the exposure. 

The failure in Experiment 6 to detect a 
substantial difference in accuracy between 
partial reports of only letters (or only 
numbers) and whole reports clearly illus- 
trates that partial reports by position are 
more effective for studying the capacity of 
short-term information storage than partial 
reports by category. 


Experiment 7: Order of Report 


Interpretations of the effects of instruc- 
tions upon the report following a single 
brief visual exposure have often been con- 
cerned with either the perceptual sensitizing 
effects of an instruction given before the 
exposure or with the importance of for- 
getting between the exposure and а post- 
exposure instruction to report. The decay 
curves of Experiment 4 include both of 
these effects. Previous studies, however, 
have usually assumed the order in which 
the various parts (aspects, dimensions, etc.) 
of the stimulus are reported to be the sig- 
nificant correlate of post-exposure for- 
getting. The Possibility that information 
might be well retained even though not 
immediately reported has been mentioned 
(Broadbent, 1958), but experimental in- 
vestigations of such an effect by an inde- 


pendent variation of the order of report 
by Wilcocks (1925), Lawrence and Laberge 
(1956), and Broadbent (1957a) have ap- 


parently shown otherwise. Broadbent 
(1957a) has also shown a case in which 
independent variation of the order of report 
did not reduce overall response accuracy, 
In the present experiment, ord 
port is introduced as a purely “nuisance” 
variable for the S, The 5 is instructed to 
get as many letters correct as Possible, but 
the E randomly manipulates the order in 
which they are to be reported, 
ment is а survey of how Ss а 
kind of interference with the n 
of their report. 


er of re- 


ormal order 


ê. Ê "s were instructed to write the 
Ese on CE tone (high, low) first, чп 
to write the other row. They were to try to ge 
as many total letters correct as possible, it hs 
of no “importance in which particular row E 
correct letters might be. The instruction tone was 
given with 0.0-, .30- (or .50-), and 1.0-sec. delay 
after the termination of the stimulus. — 
я addition to the trials with a 
PE aee age pee of 8 (or 10) trials hor 
given with a neutral, middle tone. The istruiti 
Nase “Write all the letters in any order you WIS» 
but do not begin writing until you hear the а 
The tone was sounded with 0.0-зес. delay fol SR 
ing termination of the stimulus and also Mo 
1.0-зес. delay. It bears repeating here HC, ca 
were not permitted just to mark X's but w 
required to guess various letters. | Y 
Results. Controls: The instruction ux 
required Ss to wait for 1.0 sec. before he 
ginning to write their answer was igno Пу 
by the Ss, since it was almost pe 
impossible to begin writing sooner. ;als 
sequently the two different controls—tr 
on which S was required to “wait” for his 
sec. and trials on which 5 could bee ed 
report immediately—are grouped то ег 
These data, which are almost exactly Е. 
Same as the memory span data (Expe n 
ment 1), are presented on the far right 
Fig. 10. «gl 
The Ss' responses on the control Ur 
are analyzed in terms of the correlati 
between the location of letters on of 
stimulus and the accuracy of the ener 
these letters in the response. The ee 
T and B above I-M in Fig. 10 represent 
percentage of the letters of the top an^ ly- 
the bottom rows that Ss report corren 
The middle point is the average percent? z 
of the letters of the top and bottom ES E 
that were correctly reported by SS- -e 
middle point is therefore also the ег 
Percentage correct of all the letters + 
Were reported, Figure 10 shows that all 
report the top row of the stimulus 1107, 
accurately than the bottom row, if they “a 
not instructed with regard to the order 
which they must report the rows. св 
The average accuracy with which ней 
report the top and the bottom rows, W { 
Instructions to report one or the оет 5 
these rows аге given with various dm in 
after the exposure, is also illustrate 


T 
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Fic. 10. Accuracy of the first (second) row reported and of the top (bottom) row as a function of 


the delay of the instruction to report one row first. 


memory (1-М) at right. 


zio. 10. The accuracy of reports of the top 
is DAS slightly as the delay of the 
instructions increases. In other respects, 
however, the data show no systematic 
changes in accuracy with changes in the 
delay of instructions. The data clearly indi- 
cate that the top row is generally reported 
more accurately than the bottom row al- 
though the instruction to report each row 1s 
given with equal frequency. . 

The same data may also be analyzed with 
regard to the accuracy of the row that must 
be reported first and the row that is re- 
ported last. АП Ss except ROR are more 
accurate when they report the first TOW 
(the row called for by the instruction tone) 
than the second row. For most Ss, there- 
fore, the order of report is correlated with 
the accuracy of report. 

There is a slight tendency for the accu- 
racy of report of the row which is reported 
first to decrease as the delay of the instruc- 


Light flash is indicated at lower left; immediate- 


tions increases. On the whole, however, the 
overall accuracy of report decreases slightly 
with the delay of the instruction to report 
one row first. The experimental interfer- 
ence with the normal order of report does 
not change the overal number of letters 
reported correctly by any 5 by more than 
about 0.5 letter. 

In this task, unlike the preceding ones, 
individual differences are more striking 
than the similarities. The pooled data are 
highly untypical of three of the five Ss. 
Figure 11 was devised as a two-dimensional, 
graphical analysis of variance to compress 
the details of Fig. 10 into one figure. 
Each coordinate represents the accuracy of 


1? Figure 11 is based upon a suggestion by E. B. 
Newman. A statistical analysis of variance was 
not attempted since it would have had to be carried 
out separately on each S. There was not enough 
data to make this worthwhile, and Fig. 11 serves 
the same purpose. 
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Fic. И. Graphical analysis of position on 
stimulus vs, order of Teport as contributors to 
Tesponse accuracy. Each point 


represents the 
average of all trials of an S at a particular delay 


of instruction, The order in which the points are 
Connected corresponds to the magnitude of the 


delays. Upper leit: position Preference in control 
(immediate-memory) report, 


one row of the report relative to the whole 
Teport. Thus the ordinate represents the 
number of letters that an individual 5 ге- 
in the top row of the 
ntly of order) divided 
of letters (both rows) 


rrectly, Similarly, the 
Presents the perc, 


ct letters reported 
In his Teport of 


total corre 
Contained 


aph is inacces- 
1f necessary, he is 
Since 


he data is justifiable. 
т it is immediately evident 
for example, 50% of the correct 
letters that JC Teports are from the top 
row and, by implication, 50% are from the 
bottom row of the stimulus. More than 
70% of the correct letters that JC reports 
are in the first row терог i 

represents the Converse, 


preferring t 
port the top row 


O re- 
accurately. 


remaining 


indifferent to whether it is called dur pe 
or last. Other 5з lie between sd e 
tremes, each 5 maintaining. s. eue | 
the same relative accuracy for the ar a 
the first rows throughout the various “a 
conditions. Each 5, therefore, ре р 
within a characteristic, limited arca Pilar 
graph. ND is an exception. At zero C pee 
of the instruction tone, both pem nk 
order account heavily for the correct № MD 
reported by ND. At 0.5-зес. delay Ж 
ignored the order (preferring to alas 
trate on the top row), and at 10-886. е 
she lost her position (top row) Е т 
as well. Tn these three conditions, the | Бу 
number of letters correctly reportec А 
ND remained approximately the sat a 
within 0.5 letter of the control wore 
At LO-sec. delay neither position nor wil 
order contribute to N D's accuracy of i aen 

АП Уз operate in the upper right quac that 
of Fig. 11. This illustrates the finding icu 
no S consistently reported the bottom per 
more accurately than the top, nor ue ee 
row better than the row first called for = 
the instruction tone, Tt does not, of ene 
indicate that Ss could not report the Бшш 
Tow or the last row more accurately Ede 
other conditions, While 5$ normally M 
quite consistently, the data of ND show t vs 
they may try a number of different oe 
cedures. The instructions given the Ss po 
to the exper: 


iment were not restrictive: «ds 
Specific procedure for making a report. dE 
Suggested to the Ss, because the purP з 
of the experiment was to find out BON и 
respond when they are not given deta!* 


: ы oa 5 ctio. 
instructions. With suitable instruct! 10 
RI м sou 

training, and reinforcements, Ss ©? 


E e the 
probably be induced to make most Of 


Possible kinds of reports that can be p^ 
gramed by Fig. 11. This remains ? 
empirical problem, -nent 

he results obtained in this experime 
support the Conclusions that both a posite 
Preference апа the order of report ОТС! 
narily correlate with the accuracy of б 
Sponse, but that probably neither ud 
necessary conditions for response НР 
Some Ss can relinquish the position ше 1 
ence and а favorable order of report This 
по appreciable decrement in accuracy- 


| 
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finding is in opposition to Lawrence and 
Laberge's (1956) contention that accuracy 
is accounted for by the order of report. 
Accuracy and order are often correlated, 
but if a favorable order of report is not 
necessary for accuracy, then it cannot be 
the cause of accuracy. 

When S is given a signal indicating which 
row is to be reported first (Experiment 7). 
the accuracy of report of the row indicated 
by the signal (the first row reported ) may 
be compared to the accuracy of the partial 
report (Experiment 3). The overt pro- 
cedure on each trial is quite similar in 
Experiments 3 and 7. The only difference 
is that in the order of report experiment, 
after the Ss have finished writing the row 
indicated by the signal, they must also write 
down the other row. In the partial report 
procedure they do not have to write the 
second row. The partial report and the 
order of report experiments also share a 
common dependent variable: the accuracy 
of report of the row indicated by the in- 
struction signal. 

In view of the similarity in procedure. 
it is surprising that the accuracy of this 
common datum should be so different in 
the two experiments. For example, when 
Ss give only the partial reports (the in- 
struction signal being given immediately 
after termination of the stimulus), then 
they report 90% of the letters correctly in 
one row of 4/4 stimuli. When they are 
required to write the other row also, then 
they report only 699% of the first four 
letters correctly. Every 5. individually. 
gives а more accurate partial report ( Ex- 
periment 3) than a report of the first row— 
of two rows to be reported (Experiment 7). 
The consistent superiority of the partial re- 
port over the first half of a whole report 
prevails even when the instruction to report 
is delayed for 0.5 (or 0.3) sec. In all cases 
where data are available, cach S reports а 
row of four letters more accurately when 
he does not have to write another row of 
four letters afterwards. That what Ss must 
write later should affect the accuracy of 
what they write first must be explained— 
if we disregard teleological explanation—by 
the effect of prior instructions on the accu- 


racy of the report. In other words, if order 
of report is effective in determining the 
accuracy of report, then this effect must be 
a function of instructions given prior to 
any report at all. For some Ss, no effect of 
order of report upon response accuracy was 
observed. 

The two findings, that partial reports are 
uniformly more accurate than whole reports 
and that order of report may be uncorrelated 
with accuracy, contradict Lawrence and 
Laberge’s conclusion that “partial” reports 
are essentially similar to “first” reports. In 
fact, the second finding (that in some cir- 
cumstances order of report and accuracy 
of report are not correlated) provides a 
direct counterexample to their conclusion. 
Their different results may be in part due 
to the vastly different stimuli which they 
used. Lawrence and  Laberge's entire 
stimuli each contained less information than 
two randomly chosen letters. 


DISCUSSION 

In all seven experiments, Ss were re- 
quired to report the letters of briefly ex- 
posed lettered stimuli. Two kinds of reports 
were explored: partial reports, which re- 
quired the Ss to report only a specified 
part of the stimulus, and whole reports, 
which required the Ss to report ali the 
letters of the stimulus. Experiment 3 
demonstrated that the accuracy of partial 
reports was consistently greater than the 
accuracy of whole reports. Another im- 
portant difference between partial and whole 
reports is the correlation of ассигасу with 
the delay of the instruction to report. This 
was shown in Experiment 4 in which the 
time delay of the instruction signal, which 
indicated the row of the stimulus to be 
reported, was varied. The accuracy of the 
partial report was found to be a sharply 
decreasing function of the time at which 
the instruction was given. Tf the instruc- 
tion signal was delayed for one sec. after 
the exposure, the accuracy of the partial 
report was no longer very different from 
that of the whole report. In Experiment 7 
it was shown that the accuracy of the whole 
report does not change as the time of the 
signal to report is varied—over the same 
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range of time—up to one sec. after the 
exposure. 

The two kinds of report can also be 
considered in terms of the information (in 
these experiments, letters) which they indi- 
cate the 5 has available for report. In the 
whole report, the S reports all the informa- 
tion that he can. When he gives a partial 
report, the 5 may have additional available 
information that is not required for the 
report. A calculation of the information 
available to the Ss for their partial reports 
indicates that between two and three times 
more information is available for partial re- 
ports than for the whole reports. This 
discrepancy between the two kinds of report 
is short-lived. Information in excess of 
that indicated by the whole report was 
available to the Ss for only a fraction of а 
second following the exposure. At the end 
of this time, the accuracy of partial reports 
is no longer very different from that of 
whole reports. 

The whole report has already been ex- 
tensively studied by psychologists. The 
maximum number of items an individual 
can give in such a report 15 called his span 
of unmediate-memory ; whole reports are 
usually called immediate-memory reports. 


Experiments 1 and 2 extend the well-known 
conclusions that the span of immediate- 
memory is an individual characteristic and 
that it is constant over a wide range of 
stimuli and exposure conditions, Although, 
in immediate-memory experiments, items 
are conventionally presented sequentially, 
Experiments 1 and 2 illustrate that this is 


not necessary—that a simultaneous presen- 
tation may also give results characteristic 
of immediate-memory experiments, 

The main problems to be considered here 
concern the partial-whole report discrep- 
ancy: (a) Why is the partial report more 


accurate than the whole report? (b) Why 
does the partial report retain this added 
accuracy only for 


a fraction of a sec 
on 
after the exposure? a 


The answers proposed are а Systematic 
elaboration of an Observation that is readil 
made by most viewers of the actual tache. 
toscopic presentation. They report that 
the stimulus field appears to be still read- 


able at the time a tone is heard which 
follows the termination of the stimulus by 
150 msec. In other words, the subjective 
image or sensation induced by the light 
flash outlasts the physical stimulus at least 
until the tone is heard. The stimulus in- 
formation is thus "stored" for a fraction 
oi a second as a persisting image of the 
objective stimulus. As the visual image 
fades, its legibility (information content) 
decreases, and consequently the accuracy of 
reports based upon it decreases. 

There is other evidence, besides such 
phenomenological accounts, that suggests 
that information is available in the form 
of an image for a short time after extinc- 
tion of the physical stimulus. In the first 
place, it is inconceivable that the observers 
should stop seeing the stimulus at exactly 
the moment the light is turned off. The 
rise and fall of sensation may be rapid, but 
they are not instantaneous. The question 15 
not whether the observer continues to see 
the stimulus after the illumination is turne 
off, but for how long he continues to 56 
the stimulus. A number of different kinds 
of psychophysical measurements of the rise 
and fall of sensation have been attempted: 
These estimates of the persistence of the 
visual sensation vary from a minimum 0 
0.05 sec. (Wundt, 1899) to almost o" 
sec. (McDougall, 1904). The most repre 
Sentative estimates are in the neighborhoo 
of 1/6 sec. (cf. Piéron, 1934), а figure that 
$ in good agreement with the results. 


isi E interactions of the eye rr. 
ann's ingeni ication 
method of Baxt to the ган К persistent! 
is Probably the only experiment that utilizes P? 
tern Stimulation, The other methods have er 
been. tried with Pattern stimuli although there ^ 
а Priori, no good reason why they have not beri 
The possibility that the persistence of palit 
information is quite different from persistence 
brightness" has not been investigated. 
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In Experiment 5 it was shown that the 
post-exposure field strongly influences the 
accuracy of both the partial and the whole 
report. This experiment indicates that the 
available information is sensitive to inter- 
ference by noninformational visual stimuli 
which follow the exposure. The dependence 
of available information upon noninforma- 
tional visual stimulation is just the depend- 
ence that would be expected of a visual 
image. | 

Finally, there are subtle aspects ої the 
sequence of letters reported by an 5 which 
characterize the information that is avail- 
able to him. In sequentially spoken letters, 
for example, there is а limit—two—on the 
number of letters that can be adjacent to 
any given letter. Different limits apply to 
a two-dimensional visual display. If in- 
formation is stored in a form topologically 
similar to the stimulus, this may be detected 
by noting the sequential dependencies that 
limit successive responses to the stimulus. 

Probably the kinds of sequential respond- 
ing that would most clearly distinguish 
visual from auditory information storage 
would be (a) the ability of the S to read 
the rows of the visual stimulus backwards 
as well as forwards, or to report the columns 
or the diagonals, and (b) his inability to 
do an equivalent task when presented with 
the information sequentially. (All these 
procedures merely require the report of 
adjacent letters if the stimulus is two 
dimensional. Unfortunately, these particu- 
lar experiments were not conducted. 

The foregoing experiments offer some 
relevant evidence. In contrast to spoken 
letters and numbers, which are most accu- 
rately recalled if they occur at the beginning 
or end of a sequence (Pollack, 1952) ?* 
no obvious gradients of accuracy were 
found in the foregoing experiments. The 
middle row actually tended to be slightly 
better reported than the other rows. There- 
fore, it is unlikely that the entire visual 
stimulus (12-letters and numbers) was 
transformed into an auditory (sequential) 
representation for storage. Such an entire 
transformation is also unlikely, though not 


x т 
14 Summarized іп Luce (1956). 


impossible, because of the relatively small 
time between the stimulus exposure and the 
report. 

An analysis of errors reveals numerous 
cases of errors that may be classified as 
“misreading” (for example, confusions be- 
tween E and F, B and R) and as “mis- 
hearing” (for example, confusions between 
B and D, D and T—Miller & Nicely, 1955). 
Still other confusions (for example, C 
and G) are ambiguous. All of these types 
of errors occurred whenever errors oc- 
curred at all. The ubiquity of misreading 
and mishearing errors, taken at face value, 
suggests that both visual and auditory 
storage of information are always involved 
in both whole and partial reports. А non- 
quantitative error analysis is therefore not 
likely to shed much light on the question of 
visual imagery. The frequent mishearing 
errors suggest that the storage of letters, 
just prior to a written report, may share 
some of the characteristics of audition. Like 
the preceding analysis of the constraints 
upon successive responses, error analysis 
requires considerable research before it can 
be quantitatively applied to problems of 
imagery. 

This then is the evidence—phenomeno- 
logical reports, the effects of the post- 
exposure fields, the known facts of the 
persistence of sensation, and the detailed 
characteristics of the responses—that is 
consistent with the hypothesis that informa- 
tion is initially stored as a visual image and 
that the Ss can effectively utilize this in- 
formation in their partial reports. In the 
present context, the term, visual image, is 
taken to mean that (a) the observer behaves 
as though the physical stimulus were still 
present when it is not (that is, after it has 
been removed) and that (b) his behavior 
in the absence of the stimulus remains a 
function of the same variables of visual 
stimulation as it is in its presence. The 
units of a visual image so defined are al- 
ways those of an equivalent "objective 
image," the physical stimulus. It is as logi- 
cal or illogical to compute the information 
contained in a visual image (as was done in 
Experiments 3 and 4) as it is to compute 
the information in a visual stimulus. 
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"Visual image" and "persistence of sen- 
sation" are terms suggested by the asyn- 
chrony between the time during which a 
stimulus is present and the time during 
which the observer behaves as though it 
were present. Although asynchrony is in- 
evitable for short exposure durations, there 
is, of course, no need to use the term "visual 
image" in a description of this situation. 
One might, for example, refer simply to 
an "information storage" with the charac- 
teristics that were experimentally observed. 
This form of psychological isolationism 
does injustice to the vast amount of relevant 
researches. 

Imagery that reputedly occurs long after 
the original stimulation (memory images, 
eidetic images, etc.) is of interest as well 
as imagery that occurs for only a few tenths 
of a second following stimulation. Whether 
the term "imagery," as it has been used 
here to describe the immediate effects of 
brief stimulation, is an appropriate term for 
the description of the lasting effects of 
stimulation is an empirical problem. It js 
hoped, however, that the principles and 
methods developed here will not be without 
relevance to these traditional problems. 


Persistence of Vision and Afterimages 

Between the short 
and the remembranc 
event, there is an 
the afterimage, 
tion. 


Persistence of vision 


intermediate situation, 


n which requires considera- 
In discussin ‚ it will be 


intense or pro- 
Te is the “initial” 
"image" (or sen- 


ог perception, or 
combination of a term fro 


m 
the first and from the second of these 
groups may be used, The initial ima is 
followed by 


a latent period du 
nothing is seen and which May in turn he 
followed by a complex Sequence of after- 
images. Afterimages may be either positive 
or negative; Almost апу sequence is possi- 
ble, but the initial image is almost always 


ring which 


positive)? Some authors distinguish We 
initial image from a positive afterimage 
(for example, McDougall, 1904); pas 
do not (for example, von Helmholtz, 1924- 
25). It is often implicit in such distinctions 
that the persistence of the initial image ЗЯ 
due to a continued excitatory process, 
whereas afterimages arise from receptor 
fatigue. If there is no repeated waxing a 
waning of sensation, but merely a singe 
rise and fall, one cannot distinguish SANE 
phases in the primary А d 
responding to the "initial image anc tî 
other to an identical "positive afterimage 
of it. - 
Although it is difficult to prove beam 
visual information is stored in the — 
image, there can be no gainsaying that Т6 
afterimage тау be a rich store of infor € А 
tion. Positive or negative afterimages o 
carry many fine details, including detar $ 
that were not visible at the time of stimula- 
tion (von Helmholtz, 1924-25). Afte! 
images generally last for at least severa 
seconds, and following high energy stimu isd 
tion they normally last for several minute 
(Berry & Imus, 1935). The clarity of th 
details, of course, deteriorates with the ра 
sage of time. Since afterimages арреат fn 
move when the eye is moved, they usua 
have been considered retinal phenomen, 
Taken together, these facts imply that mar 
is a considerable capacity for visual ra 
formation storage in the retina. 


to 


/ eae А sed in 
illumination of the stimulus cards à 
^ fon 1 1 } 

the foregoing experiments had been St 


пе 
blaze the letters upon im 


take maximum аймай ое 
1 Storage capacity, Pur 
little doubt afterwards 
location of most of | 
mation, The stimulus prese 
actually used, however, rarely elicit 
reports of afterimages: c. usually тербе, 
“ети simply а single brief flash. p 
problem is therefore to determine the ре 


ciently intense to 
retina and thereby 
of its informatior 
would have been 
to the nature or 
available infor 
tations 


as 
1€ 
n^ 


e 


Bidwell (1 
conditions 
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rib 
897) and Sperling (1960) dese ge 
| ; seeing a negative "aero in 
Without prior Positive image," The metho of 


volves и Presentation quite similar to tha 
Baxt (1871). 


for 
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sistence of the image of the brief flash, or 
equivalently, the duration of seeing (the 
stimulus) or the persistence of vision (of 
the stimulus), rather than the duration of 
an afterimage. These terms are used to 
suggest that the S feels he is responding 
directly to the stimulus rather than to after- 
effects of stimulation. 

Psychologists have often carelessly as- 
sumed that the absence of discernible after- 
images following a visual presentation was 
sufficient to insure that the duration of sen- 
sation will correspond to the duration of the 
Physical stimulus, that is, that there is no 
Persistence of vision at all. Wundt (1899) 
was one of the first to take vigorous excep- 
tion to this naive view. Wundt's most com- 
pelling example was drawn from Weyer 
(1899). Weyer had found that two 40- 
microsecond light flashes had to be sepa- 
rated by 40 to 50 msec. in order for them 
to be seen as two distinct flashes; at smaller 
separations they were seen as a single flash. 
In the dark adapted eye, the minimum 
separable interval that consistently yielded 
reports of "two flashes" was 80 to 100 
msec. 

Wundt argued that the two flashes could 
not be seen as distinct until the sensation 
occasioned by the first flash had ceased. 
Thus, under optimum conditions, the mini- 
mum duration of the sensation of a short 
flash was at least 40 msec. which, in this 
case. was 1,000 times the stimulus duration. 
Wandt thought that, in order to determine 
tue duration of a longer flash, one must 
merely add the 40 msec. of fade-out time to 
the actual physical light duration. While 
these details of Wundt's reasoning may be 
questioned, his main point, based on "e 
example of the short flash, is indisputable : 
one does not directly control the time for 
which information is visually available 
simply by manipulating exposure duration. 
The experiments reported here provide a 
direct proof of this assertion. 


An Application of the Results to 
“Refore and After” Experiments 


The previous experiments showed that 
more information is available to Ss for a 


few tenths of a second after the exposure 
than they can give in a complete report of 
what they have seen. It was suggested that 
the limit on the number of items in the 
memory report is a very general one, the 
span of immediate-memory, which is rela- 
tively independent of the nature of the 
stimulus. Evidence was offered that infor- 
mation in excess of the immediate-memory 
span is available to the S as a rapidly fading 
visual image of the stimulus. If more in- 
formation is available to him than he can 
remember, the S must “choose” a part of 
it to remember. In doing so, he has chosen 
the part to forget. In Experiments 3 and 4, 
Ss exercised only locational choices, that is, 
portions of the stimulus were remembered 
only on the basis of their location. Loca- 
tional choices are probably not the only 
effective choices that the S сап make. Dur- 
ing the short time that information is avail- 
able to him, the S may process it in any 
way in which he normally handles informa- 
tion. Usually, what he does, or attempts 
to do, is determined by the instructions. 
The S’s (unobservable) response to the 
stimulus is probably the same whether the 
instruction to make this response is given 
before the stimulus presentation or after 
it; the difference between the two cases lies 
in the information that the response can 
draw upon. If the stimulus contains more 
information than the S's immediate-memory 
span, and if the post-exposure instruction is 
delayed until the S has little of this extra 
information available, then а difference in 
the accuracy of the responses with prior- 
and post-exposure instructions will be ob- 
served. If the stimulus does not contain 
more information than can be coded for 
immediate-memory, or if the post-exposure 
instruction is given soon enough so that 
the S can utilize the still available informa- 
tion effectively, then only minor differences 
in the accuracy of responses with prior- and 
post-exposure instructions will be observed. 
If the stimulus is destitute of information 
(for example, а single, mutilated, dimly 
illuminated letter of ihe alphabet) then a 
host of other factors which are normally 
insignificant may become crucial. In this 


case, the "stimulus" itself may well be 


24 GEORGE SPERLING 


irrelevant (Goldiamond, 1957), and the 
effects of instructions given before or after 
the exposure must be predicted on some 
other basis. 


There are some simple experiments in 
which it is known a priori that the effects 
of instructions given either before or after 
the exposure will be exactly the same. This 
degenerate situation can be illustrated by a 
stimulus which is exposed for one micro- 
second and with sufficient energy to be 
clearly visible. By suitable coding, the pre- 
exposure and post-exposure instructions 
can be separated by only two microseconds. 
The example serves to emphasize that what 
is implicitly referred to by “before and 
after” is not the exposure but something 
else: traditionally, the sensitization and/or 
forgetting that presumably occur in con- 
junction with the exposure. Thus, the 
theory that has been presented here merely 


gives an explicit statement of assumptions 
that have long been implicit. 


Unobservable Responses and 
the Order of Report 


The subjective response to the high signal 
tone is "looking up." Since eye movements 
Cannot occur in time to change the retinal 
image with any of the presentations used 
(Diefendorf & Dodge, 1908) a successful 
looking-up must be described in terms of а 


shift in "attention." Nonetheless, Such a 
Shift in attention 


Studied by means 


short time difference, 0.1 sec., accounts for 
the similar accuracy of responding in these 
two conditions. 

Once his attention is directed to the ap- 
propriate row, the 5 still has to read the 
letters. This, too, takes time. Baxt's (1871) 
data indicate that the time required to read 
a letter is about 10 msec. Baxt's experiment, 
with some modifications, was repeated by the 
author, and similar results were obtained.’ 
~ How is all this relevant to the order of 
report? The order in which the letters are 
finally reported can be an important variable 
because of (a) purely temporal factors 
(letters that are reported first will be more 
accurately reported only because they are 
reported sooner aíter the exposure. the 
actual order of reporting the letters per $ 
being relatively unimportant) or (b) ies 
action effects (the report of some letters 1 
detrimental to the report of the remaining 
letters, that is, letters reported later su к 
from proactive interference by the letter 
reported earlier). 

That purely temporal factors cannot Þe 
very important can be seen from the slop? 
of the curves describing available inform" 
tion as a function of time. In the foregoing 
experiments, the amount of available 12" 
formation approached the immediate 
memory span at about 0.5 to 1.0 sec. altel 
the exposure; further decrements in ava! 
able information as a function of time a 
slight. The report of the letters usually 
does not begin until 1.0-1.5 sec. after pe. 
exposure, “The passage of time during ш 
actual time that letters are being reporter 
therefore, cannot account for appreciab E 
accuracy changes as a function of the 010 
of report, í 

The second possible effect of order a 
Teport—the interfering effect of the letter 
reported first upon unreported letters 
cannot be so readily discounted. Proactiv” 
interference would imply that partial r 
ports are more accurate than whole геро! a 
by an amount dependent upon their relat! d 
lengths. The results of Experiment 4 tend 
to support this view. At delays of the ' 
—— 

16 Sperling, 


С. Unpublished iments 229, 
ducted at the Be Mien eic one 


Bell Telephone Laboratories, 
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struction signal greater than 0.3 sec., partial 
reports of three letters (from stimuli of 
nine letters) indicate more available letters 
than do partial reports of four letters (from 
stimuli of twelve letters). On the other 
hand, in Experiment 7 one 5, КОК, does 
not show decreased accuracy as a function 
of the length of his report. ROR is able to 
report eight letters as accurately when he 
begins his report with three or four incor- 
rect letters as when he ends his report with 
three or four incorrect letters. Other 5s did 
not systematically attempt to report incor- 
rect letters first. Had they been required to 
report incorrect letters first, they might well 
have been able to do so. 

The choice of what part of the stimulus 
to attend to or of which letters to read is 
the choice of what fraction of the stimulus 
information to utilize. This choice can be 
made successfully only while the informa- 
tion is still available. The Ss prefer to 
report what they remember first, but this 
does not imply that they remember it be- 
cause they report it first. It is difficult to 
disentangle the many factors that determine 
precisely what stimulus letters will appear 
in the response, but important choices of 
what information is to be recalled must 
occur while there is still something to 
choose from. Since the actual report begins 
only when there are available but a few 
letters in excess of the immediate-memory 
span, the order of report can at most play 
only the minor role of determining which 
of the few "excess" letters will be for- 
gotten." 


The Questions of Generality 
and Repeatability 


To what extent are the results obtained 
limited to the particular conditions of the 
foregoing experiments? The possibility that 
the actual physical fading of the light 
source is important to the availability of 


— 

17 There are many ways in which proactive 
interference might occur. For example, if letters 
are stored "sequentially" prior to report (cf. 
Broadbent, 1957b), then the importance of order 
of report may lie in the agreement of the two 
sequences : storage and report. 


information can be rejected not only on 
prior grounds (see Apparatus) but also by 
the empirical findings. For example, in 
Fig. 9, the curve representing the number 
of letters available 0.15 sec. after exposure 
is quite similar to the 0.0-sec. delay curve. 
There is no visible energy emitted by the 
light source 0.15 sec. after its termination. 

In the present case, the answer to the 
problem of repeatability of the results is 
made less speculative by three separate 
investigations that have since been con- 
ducted with similar techniques to those re- 
ported here. 

The experiments have been repeated by 
the author?? with a different tachistoscope, 
timer, and a light source that has only a 
negligible afterglow. АП the main findings 
were reproducible. 

Klemmer and Loftus (1958) confirmed 
the existence of a short-term, high informa- 
tion storage. They used a display consisting 
of four discrete line patterns, the S being 
required to report only one of these. The 
instruction was coded either as a signal 
light or verbally. Decay curves obtained 
when the instruction is delayed are similar 
to those reported above. A similar experi- 
ment has also been conducted by Aver- 
bach,? who used a television tachistoscope 
to present stimuli containing up to 16 letters. 
A pointer appeared above the letter to be 
reported. Initially, Ss had about twelve 
letters available for report, but the number 
decreased rapidly when the visual instruc- 
tion was delayed. 

It is usually technically more difficult to 
code instructions visually than acoustically. 
Although the principle of sampling in order 
to determine available information is com- 
mon to both kinds of instructions, visually 
coded instructions differ in some interesting 
ways from acoustically coded instructions. 
For example, the time taken to "interpret" 
—or even to find—a visual instruction may 
well depend on its location relative to the 
fixation point. Moreover, there may be 


18 Sperling, G. Unpublished experiments 
› У $ con- 
ducted at the Bell Telephone Laboratories, 1958, 


19 Averbach, E. Unpublished experiment: = 
ducted at the Bell Telephone Laboratories, 1959. 
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spatial interactions between the visual "in- 
struction" and the symbols to be reported. 
On the other hand, prior to training, the 
task oi interpreting a visual marker is 
easier for Ss than the equivalent task with 
an acoustically coded instruction. Ulti- 
mately, such differences are probably only 
of secondary importance since the two 
kinds oi experiments agree quite well. 
Three main findings emerge from the ex- 
periments reported here: a large amount of 
information becomes available to observers 
of a brief visual presentation, this informa- 
tion decays rapidly, the final level is ap- 
proximately equal to the span of immediate- 
memory. Although the exact, quantitative 
aspects of information that becomes avail- 
able following a brief exposure unquestion- 
ably depend upon the precise conditions of 
presentation, it seems fair to conclude that 
the main results can be duplicated even 


under vastly different circumstances т 
different laboratories, 


SUMMARY AND CONCLUSIONS 


When stimuli consisting of a number of 
items are shown briefly to an observer, 
only a limited number of the items can be 
correctly reported, This number defines 
the so-called “span of immediate-memory,” 


suggests that 

Process that is 
information. In 
sampling procedur 
used to circumyen 
by immediate-mem 
that at the time 

few tenths of a sec 
have two or three 
tion available 
availability 
rapidly, and wi 
exposure 


the present studies, a 
€ (partial Teport) was 
t the limitation imposed 
оту and thereby to show 
ot exposure, and for a 


ond thereafter, observers 
times 


that БепегаПу fo 
ulation. In thi 


Sistence is that of a 


image of the stimulus. Evidence in support 
of this hypothesis of visual information 
storage was found in introspective accounts, 
in the type of dependence of the accuracy 
of partial reports upon the visual stimula- 
tion, and in an analysis of certain response 
characteristics. These and related problems 
were explored in a series of seven experi- 
ments. 

An attempt was first made to show that 
the span of immediate-memory renam 
relatively invariant under a wide range о 
conditions. Five practiced observers were 
shown stimuli consisting of arrays of synt 
bols that varied in number, arrangement, 
and composition (letters alone, or letters 
and numbers together). It was found (a= 
periments 1 and 2) that each observer W ^t 
able to report only a limited number : 
symbols (for example, letters) goren! 
Гог exposure durations from 15 to b. T 
msec., the average was slightly over four 
letters; stimuli having four or fewer letter? 
Were reported correctly nearly 10096 of the 
time. : 

In order to circumvent the immediate 
memory limit on the (whole) report о 
what has been Seen, observers were require : 
to report only a part—designated by ips 
tion—of stimuli exposed for 50 ms a 
(partial report), The part to be reporte 
usually one out of three rows of letter? 


=) to 
was small enough (three to four letters) t 
lie within the 


(high, middle, 
to indicate wh 
reported. The 


€ indicator signal was no. 
r the visual stimulus par 
n this manner, the in reli 
o the 5 was sampled im 
after termination of the stimulus 
‚ for each material teste 
12 symbols), gave partial report? 
that were more accurate than whole repo! ж 
for the same material. For example, rollo", 
ing the exposure of stimuli consisting of 
symbols, 76% of the letters called for in t и 
Partial report were given correctly by Е 
observers, This accuracy indicates that (7, 
total information available from which [^ 
Observer can draw his partial report 
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about 9.1 letters (76% of 12 letters). This 
number of randomly chosen letters is 
equivalent ro 40.6 bits of information, which 
is considerably more information than pre- 
vious experimental estimates have suggested 
can become available in a brief exposure. 
Furthermore, it seems probable that the 
40-bit information capacity observed in 
these experiments was limited by the small 
amount of information in the stimuli rather 
than by a capacity of the observers. 

In order to determine how the available 
information decreases with time, the instruc- 
tion signal, which indicated the row of the 
Stimulus to be reported, was delayed by 
Various amounts, up to 1.0 sec. (Experi- 
ment 4). The accuracy of the partial report 
Was shown to be a sharply decreasing func- 
tion of the delay in the instruction signal. 
Since, at a delay of 1.0 sec., the accuracy 
of the partial reports approached that of 
the whole reports, it follows that the in- 
formation in excess of the immediate- 
memory span is available for less than a 
second. In contrast to the partial report, the 
accuracy of the whole report is not a func- 
tion of the time at which the signal to 
report is given (Experiment 7). | 

'The large amount of informatior in ex- 
cess of the immediate-memory span, and 
the short time during which this informa- 
tion is available, suggests that it may be 
stored as a persistence of the sensation re- 
sulting from the visual stimulus. In order 
to explore further this possibility of visual 
information storage, some parameters of 
visual stimulation were studied. A decrease 
of the exposure duration from 50 to 15 
msec. did not substantially affect the accu- 
тасу of partial reports ( Experiment 5). On 
the other hand, the substitution of a white 
post-exposure field for the dark field 
ordinarily used greatly reduced the accu- 
racy of both partial and whole reports. The 
ability of a homogeneous visual stimulus 
to affect the available information is evi- 
dence that the process depends on а per- 
sisting visual image of the stimulus. 

Whether other kinds of partial reports 
give similar estimates of the amount of 
available information was examined by ask- 
ing observers to report by category rather 


than by location. The observer reported 
numbers only (or the letters only) from 
stimuli consisting of both letters and num- 
bers (Experiment 6). These partial reports 
were no more accurate than (whole) re- 
ports of all the letters and numbers. The 
ability of observers to give highly accurate 
partial reports of letters designated by loca- 
tion (Experiment 3), and their inability to 
give partial reports of comparable accuracy 
when the symbols to be reported are desig- 
nated as either letters or numbers, clearly 
indicates that all kinds of partial reports 
are not equally suitable for demonstrating 
the ability of observers to retain large 
amounts of information for short time 
periods. 

In the final study (Experiment 7), the 
order of report was systematically varied. 
Observers were instructed to get as many 
letters correct as possible, but the order in 
which they were to report the letters was 
not indicated until after the exposure. An 
instruction tone, following the exposure, 
indicated which of the two rows of letters 
on the stimulus was to be reported first. 
This interference with the normal order of 
report reduced only slightly the total num- 
ber of letters that were reported correctly. 
As might be expected, the first row—the 
row indicated by the instruction tone—was 
reported more accurately than the second 
row (order effect). There was, however, 
a strong tendency for the top row to be 
reported more accurately than the bottom 
row (position effect). Although, as a group, 
the observers showed both effects, some 
failed to show either the order or the posi- 
tion effect, or both. The fact that, for some 
observers, order and position are not cor- 
related with response accuracy suggests that 
order of report, and position, are not the 
major causes of, nor the necessary con- 
ditions for, response accuracy. The high 
accuracy of partial report observed in the 
experiments does not depend on the order 
of report or on the position of letters on 
the stimulus, but rather it is shown to 
depend on the ability of the observer to read 
a visual image that persists for a fraction 
of a second after the stimulus has been 
turned off. 
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| 
ip recent years, a. number of important 
\ explorations of the use of projective 
| measures in the survey interview setting 
(Campbell, 1950; Douvan & Walker, 1956; 
| Lansing & Heyns, 1959; Maccoby & Mac- 
| coby, 1954; Sanford, 1950; Sanford & 
Rosenstock, 1952; Scott, 1956; "Tompkins 
& Miner, 1958) have foreshadowed an in- 
direction. of future research in 


| evitable 
social psychology: the common use in sur- 
| vey studies of assessment techniques devel- 
oped in experimental and clinical studies of 
personality dynamics. This trend can be 
expected to promote a more meaningful 
integration than now exists of the societal 
. data continually amassed in surveys of 
' broad populations and the research findings 
accumulated in the clinics and laboratories 
| of personality research. How quickly this 
1 integration will come about will depend in 
large measure upon the conscientiousness 
with which certain serious methodological 
broblems are faced by the investigator who 
eaps from the laboratory or clinic to the 
loorstep of a startled housewife, inter- 
viewed because she happened to be selected 
oy the magic of probability sampling. Con- 
ceptual integration of facts from field study, 
“clinic, and experiment will be hastened if 
the personality assessment devices selected 
. from the laboratory have already been in- 
vestigated with some thoroughness. 
The present investigation is an attempt to 
foster such an integration. We present here 


1 Тһе investigations reported іп this paper аге 
based on data obtained within a national sample 
survey supported by the Joint Commission on 
Mental Illness and Health (Gurin, Project Direc- 
Analysis of these data was supported by two 


tor). С 
mi from the National Institute of Mental 
Health, United States Public Health Service, 
Projects M2181 (Veroff, Principal Investigator) 


and M2280 (Gurin, Principal Investigator). 


Survey Research Center, University of Michigan 


an examination of the methodological prob- 
lems faced in the first attempt to introduce 
thematic apperceptive measures of three 
motives (n Achievement, n Affiliation, and 
n Power) into a nationwide sample survey. 
Considerable research has been conducted 
to develop these measures in the experi- 
mental setting (Atkinson, 1958; McClel- 
land, Atkinson, Clark, & Lowell, 1953). 
Consequently, in adapting these measures to 
the survey setting, we are in a position to 
be guided by a fund of research findings 
which define their validity and highlight the 
methodological issues to be given serious 
consideration. 

A decade of experimental work has 
shown that experimentally induced motiva- 
tional states influence the content of imagi- 
native thought in ways that can be reliably 
coded to yield measures of important social 
motives—n Achievement, n Affiliation, and 
n Power. The use of these methods of con- 
tent analysis in studies to assess individual 
differences in the strengths of these motives 
has produced an accumulation of factual 
information concerning the nature of moti- 
vational influences on behavior—at least 
within the population of high school and 
college students who have been the chief 
subjects of study. While results on hand 
attest to the validity of the approach, con- 
siderable further experimental refinement 
of the present measures is, without ques- 
tion, required. Nevertheless, several ex- 
plorations using these measures in different 
cultural settings, i.c., in situations outside of 
the college laboratory (Child, Storm, & 
Veroff, 1958; Douvan, 1958; McClelland & 
Friedman, 1952; Rosen, 1956), have already 
suggested their promise for field research 
particularly for survey studies. We realize, 
however, that a number of weighty metho- 
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dological barriers stand in the way of using 
thematic apperceptive measures of motiva- 
tion in a national survey, and that these 
barriers will only be overcome through 
direct study in field research. 

The questions of immediate interest in 
this paper, therefore, have to do with 
methodological problems in using the the- 
matic apperceptive method—known to re- 
quire a modicum of verbal ability and 
known to be sensitive to the situational dif- 
ferences at the time of administration and 
known to require skill in content analysis— 
under the conditions of national survey in- 
terviewing. These questions are listed 
below. 

Can a single set of pictures be selected 
that are sufficiently relevant to the life ex- 
periences of people in all Strata of society 
to provide fair and unbiased measures of 
motivation? Can the conditions of adminis- 


be sufficiently c 


standardized of experimental 


of the content of 

of imaginative 
Stories be accomplished with the same high 
Scoring reliability that has been attained in 
S of more homogeneous 
Can anticipated differences in 
in a national sample be taken 


Our initial 
questions cannot be taken 
sider this paper as no mor 
wedge, explicitly pointed 
groundwork for further 
sights derived from clini 
and field studies of hum 


answers to these 
as final. We con- 
€ than an opening 
to laying а firmer 
integration of in- 
cal, experimenta], 
ап motivation. 


METHODOLOGICAL PROBLEMS 


Selection of Pictures 


The particular pictures used to assess шой 
tivation by thematic apperception have im- 
portant effects on the motivational content 
of stories (Birney, 1958; Haber & Alpert, 
1958; Jacobs, 1958). Pictures will differ, 
depending on their content, in the average 
amount of motivational imagery they elicit 
from any group of subjects (Ss). The 
amount of motive-related imagery that E 
picture elicits from a person pesi 
both by the strength of his motive and by 
his past experiences in settings like the one 
portrayed in the picture. Variability in 
over-all motivation scores which can be at 
tributed to peculiar characteristics of per 
Pictures used rather than to the strength о 
motive within the Ss is a source of error 
we seek to avoid. , 

The population in the survey study to be | 
reported in this paper is a sample of 1,61 
adults (21 or over), a cross-section ОЎ 
Americans living in private households, s€ | 
lected by means of probability ѕашріша 
(Kish, 1953). То construct а battery O 
Pictures that will have Similar significance 
for all social groups represented in this 
sample was the major problem of picture 
selection that we faced. Men and women, 
old people and young people, blue collar | 
Workers and white collar workers, all have | 

а4 ехрегїепсе їп different sets of life situa- 
tions, often so different that the motives 
tional significance of Particular kinds O 
Pictures js likely to vary consi derably from | 
one of these groups to another. Indeed, the 
sex difference is so important a factor 
(Angelina, 1955. Davenport, 1953; Field, 
19 1; Morrison, 1954; Veroff, Wilcox, & 
Atkinson, 1953; Vogel, 1954) that it was | 
necessary from the very beginning to pla? 

a different Set of pictures for men айы 

Women. We hoped, in addition, to ove 
come other systematic biases in picture | 
eae M 

? Atkinson 
Oretical schem 
ased on past 
Portrayed in a 
Motives of th 
motivational с 


D 
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that might be attributable to many other 

differences in social background, in order to 

allow each person a fair opportunity to ex- 

press his motivation in the test as а whole. 
. Only then could the motivation scores of 
different social groups be legitimately com- 
pared; this aim was a compelling interest in 
the initiation of our study. 

How then did we go about selecting pic- 
tures for men and women separately? The 
list of pictures employed to study n 
Achievement, n Affiliation, and n Power 
(Atkinson, 1958, Appendix ПТ) attests to 
the wealth of data about many pictures 
from assessment studies of this kind, largely 
on the college population. From this list we 
rejected pictures that were clearly oriented 
to the college setting (eg. classroom 
Scenes). Since we were interested in select- 
ing pictures that would be relevant to three 
motivational contents (achievement, affilia- 
tion, power) at one time, we also rejected 
Pictures suggesting themes that contained 
only one particular kind of motivational 
content. Initially our choices were guided 
only by intuition. 

Before an actual pretest of some pictures 
in a doorstep interview, however, a more 
Systematic evaluation of pictures was under- 
taken. It has been shown that ratings of 
pictures Рог strength of achievement- 
relatedness are correlated with the n 
Achievement scores later obtained from 
stories written about a picture by other 
groups of Ss (Birney, 1958; Haber & Al- 
pert, 1958; Jacobs, 1958). Consequently 
we employed a rating procedure to ascertain 
empirically the cue value of many pictures. 
A group of college students at the Univer- 
sity of Michigan rated 40 pictures for the 
salience of achievement, affiliation, power, 
and seven other motivational concerns. At 
least eight Ss rated each picture. From 
these ratings we tried to select pictures in 
which one kind of motivation was rather 
strongly suggested but at least one or the 
other kinds of motivation was weakly 
suggested. 

Following this preliminary selection of 
pictures, We faced more directly the po- 
tential bias of picture cues for one social 
group compared to another. We hoped to 


eliminate bias in the instrument attributable 
to pictures that allow for adequate expres- 
sion of motivation by only certain groups 
of people. Our decision, therefore, was to 
select pictures portraying separately for 
men and women a variety of life situations 
with which, in one way or another, most 
people in this country have had some direct 
contact, viz., different types of work situa- 
tions and interpersonal situations. Sampling 
from this variety of what appeared to us to 
be fairly universally-relevant situations 
should minimize the bias in the measuring 
instrument. 

Combining the ratings given by the stu- 
dents and our a priori conception of diverse 
situations for sampling pictures, we selected 
a large group of pictures for pretest in a 
Detroit residence survey. The stories told 
to these pictures were then scored for n 
Achievement, n Affiliation, and n Power. 
The pictures selected from this group for 
the final test forms, one for the men and 
one for the women, were those producing 
an adequate balance of scores for the three 
motives. The pictures finally decided upon 
are listed in order of presentation? (num- 
bers refer to the list in Appendix ПІ, At- 
kinson, 1958) : 

Male Form: 

1. Two men (inventors) in a shop working at a 
machine. (2) 

2. Four men seated at a table with coffee cups. 
One man is writing on a sheaf of papers. (101) 

3. Man (father) and children seated at breakfast 
table. (102) 

4. Man seated at drafting board. (28) 

5. Conference group. Seven men variously 
grouped around a conference table. (83) 


6. Woman in foreground with man standing be- 
hind and to the left. (103) 


Female Form: 


l. Two women standing by a table and one 
woman is working with test tubes. 

2. Woman (mother) seated by girl reclining in 
chair. 

3. Group of four women. 


Е One standing, the 
others seated facing each other. 


3 These forms are available (4" X 6" pictures). 
They may be obtained by writing to the Survey 
Research Center, Ann Arbor, Michigan. Price 
$1.00. 
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4. Woman kneeling and applying a cover to a 
chair. 


3. Two women preparing food in the kitchen. 
6. Same as 6 above. 


A striking feature of these pictures is 
that they all provide only moderate struc- 
ture to the situation portrayed. The set as 
a whole is much less strongly cued for one 
particular kind of motivation than sets of 
pictures used in experimental studies which 
have focused on only one kind of motiva- 
tion. Rejected pictures were ones that were 
considered either under- or overstructured. 
The understructured pictures (e.g., a person 
standing alone in a doorway) did not pro- 
vide sufficient cues for a story in a hetero- 
geneous population. The overstructured pic- 
tures (e.g., a woman typing in an office) 
elicited stories with too little variability of 
response. 

These two forms were administered to a 
group of 98 college students at the Univer- 
Sity of Michigan. The measures were ad- 
ministered in a latin Square design control- 
ling the order of presentation of pictures. 
Unlike the survey interview setting, the 
administration called for written Stories 
according to procedures outlined elsewhere 

(Atkinson, 1958, Appendix III). We at- 
tempted to ascertain in these administra- 
tions: (а) whether possible influences on 
the total Motivation scores of the order of 
Presentation of pictures existed, and (5) 
hair oy 
Power found in үа 1 ш, + 

der procedures » ally pens E 
Eus A normally employed on a 
Sample of college students. This study 


showed that order of Presentation of pic- 
tures did not have any 


total motive scores and 
with which achievem 
power imagery appe 
form were approxin 
lege group. 


study of a national sample, 

We realize, however, that the 
finally selected undoubtedly stil] have dif- 
ferential cue values for Some respondents 
Negroes, for example, have to respond io 


pictures 


situations depicting white people. Very old 
and very young people have to make up 
stories about figures that are obviously not 
so old nor so young as they are. The work 
scenes do not include farming, surgery. 
ditch digging, sales, and countless per 
occupations, although both blue collar v 
white collar work situations are included. 
Despite these differences, our working 95 
sumptions are that the situations ew m 
in the pictures finally selected are not а ith 
gether foreign to most of the people w! 
whom we are dealing, and that, by e 
large, the pictures will be as suggestive А 
the three motivational content areas bs. 
person of one social stratum as to 2 
in another. We had finally to recognize iya- 
only through intensive study of the mot! if- 
tional cue value of these pictures for out 
ferent social groups would we sharpen о 
insight and discover criteria that M87 . 
enable us to construct a better set m 
evaluate the over-all merits of the pres 

set of pictures. In the second section е 
this monograph, is ra! 


a ? . motive 
agam. In that section we present mo 


subgroups can be. ires 
Uted to biased selection of piet ect 
Whether these differences in fact 16 С" 
something more than biases in picture “at 
tion is a problem we consider т 
section. «сга 
The procedures followed in adminis! т 
tion of the pictures and in the analys! ub- 
motivational content will be described salts 
quently: Table 1 presents some "nent 
rom the nation ture? 4 
ye 
1 shows the frequency of imagina 
stories told in ا‎ 
which contained ч 


each kind of motiva 
content, 


ге 
Both male and female forms appear "m 
Strongly cued for affiliative imagery опт 
for achievement or power imagery. пай" 
clude, however, that the greater а ү, na 
response is indicative of some trend i! at 
tional character would require the 5 pic 
tous assumption that each set of Qu of 
tures represents equally the populati 
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| TABLE 1 


| 
| PERCENTAGE ОЕ STORIES CODED FOR ACHIEVEMENT, 
AFFILIATION, AND POWER IMAGERY 
(By PICTURE AND SEx) 


Affilia- | Achieve- 
tion ment Power 
| Sex Picture |Imagery | Imagery Imagery 
Men 
(N 597) 
1 3% 32% 11% 
2 16 23 14 
3 17 2 38 
4 53 13 T 
5 33 13 30 
6 33 3 22 
h 
| Average Raw Score* 4.85 2.90 2.84 
| Standard Deviation 3.65 3.29 2.13 
y Women 
N=774) 
1 2% 45% 10% 
2 56 3 26 
3 44 10 6 
4 6 36 1 
5 40 30 6 
6 35 1 25 
pe 
Average Raw Score" 5.31 3.97 1.84 
Standard Deviation 3.54 3.30 2.19 
a Scores based on manuals and procedures in Atkinson (1958, 
(f Chs. 12, 13, 14). | 
The category, ‘Unrelated Imagery" was scored 0 instead 


of —1. Scores obtained using 0 and —1 correlate .96. 


life situations in which n Achievement, n 
Affiliation, and n Power are normally ex- 
pressed, The differences among average 
scores can more appropriately be used as an 
index of the relative sensitivity of the pres- 
ent pictures to each of the motives and as а 
guide for improving the selection of pictures 
in subsequent studies. 

In light of Haber and Alpert’s (1958) 
definition of a strongly cued picture as one 
which elicits a particular kind of motiva- 
tional imagery from 75% or more of the Ss 
in a sample and a weakly cued picture as 
one which elicits imagery from between 
25% and 50% of Ss, we must conclude 
that the two six-picture forms employed in 
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the present study are weakly cued for each 
of the three motives. None of the pictures 
can be considered strongly cued for any 
motive; and no more than four pictures in 
the set fall in the range Haber and Alpert 
designate as weakly cued for a given mo- 
tive. Consequently, in the present study there 
should be fewer "false positives," which 
often occur when strongly cued pictures are 
used. On the other hand, the relatively low 
frequency of response has two important 
negative implications. First, scores from 
weakly cued pictures are necessarily less 
variable and therefore by themselves have 
less potential for generating strong relation- 
ships with other variables. Secondly, low 
average frequency of response to pictures 
is one of several considerations! that pre- 
clude the possibility of a meaningful split- 
half analysis of the reliability of the motive 
scores in this study. А split-half estimate 
of the reliability of the present measures is 
considered inappropriate because the aver- 
age frequency of response is so low that 
even moderately motivated S's often respond 
to only one or two pictures of the set. 


Control of the Interview Setting for the 
Projective Measures of Motivation 


In experimental studies using projective 
measures of strength of motivation, the ex- 
perimenter controls the amount of time the 


4 Earlier systematic split-half and test-retest esti- 
mates have yielded coefficients as high as .78 (Mc- 
Clelland, et al, 1953) and .74 when all the pictures 
were strongly cued for n Achievement and .53 
(Haber & Alpert, 1958, p. 661) when all the pic- 
tures fell in the 25-50% (weakly cued) range for 
n Achievement. In these systematic studies of reli- 
ability, care was taken to construct subsets of pic- 
tures having qualitatively as well as quantitatively 
similar cues. In the present study the six pictures 
were not deliberately selected with an eye to having 
two equivalent subsets of three pictures with sim- 
ilar content for each motive. Rather, the choice of 
pictures was guided by just the opposite orientation 
of (a) presenting a variety of nonoverlapping life 
situations in which (b) three different motives 
might be expressed. For a fuller discussion of 
reliability of motivation scores, as distinct from 
reliability of coding which is taken up at length in 
a subsequent section, see Atkinson (1958), Haber 
and Alpert (1958), McClelland (1958), and Reit- 
man and Atkinson (1958). 
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5 spends telling a story. When written 
forms of the test are employed, each S is 
given a booklet containing a sheet of paper 
for each story. On each page are four sets 
of questions, adapted from Murray (1953), 
which help to remind S to cover all ele- 
ments of a plot: 
What is happening? Who are the persons? 


What had led up to this situation? That is, what 
has happened in the past? 


What is being thought? What is wanted? By 
whom? 


What will happen? What will be done? 


Recognizing that we could not ask a great 
many of the potential respondents in a na- 
tional survey to write out the story, we were 
confronted with the task of fitting a proce- 
dure that was as similar as possible to the 
one above into the interview setting. We 
had to face the fact that we would be deal- 
ing with respondents who differed greatly 
in verbal fluency, with an individual admin- 
istration rather than the usual group admin- 
istration,® and with interviewers having dif- 
ferent tendencies for Probing. How could 
we command control over these differences 
to provide measures of motivation that 
would be comparable to those obtained in 
well-controlled experimental settings ? 


„Pretest already described, we ex- 
plored different Wordings of the leading 
questions and changed th 


(Who is thi ? 
What are they (is he/she) doing? ov Rust peston A) 


What do they (does he/she 
feel? 


я L4 " 
viewer had a set of 4” x 6" pictures which 


5 Lindzey and Heinemann (1955) 
from research studies of individ 
administered TATs that they 
equivalent. 


have concluded 
ual and group 
are reasonably 


he was to show to the S. The following 
excerpts from the detailed instructions given 
to the interviewers? reveal our decisions 
(a) not to make this a completely spon- 
taneous story-telling situation but to have 
the interviewer ask the plot-guiding ques- 
tions, and (5) not to allow the interviewers 
to engage in the spontaneous probing they 
usually engage in with open-ended questions 
but rather to be very specific about the coni 
ditions under which probing was permis 
sible and what these probes should be : 
Instructions to Interviewers 


In response to direct questions about himself, { 
person often cannot give you an accurate pict a 
the things he is most concerned about. It has ore 
found, however, that for some purposes ps of 
accurate picture can be obtained from the kin This 
Stories he makes up in response to pictures. do 
story-telling procedure, then, is an attempt wi. 
just that. In order to be able to compare p or- 
on the kinds of stories they tell, it is very E 
tant that each interviewer follow the same Dicit 
dure. That is why we are giving you the exP 
and detailed instructions that follow: 

l. Read the instructions for the story-tel 
procedure to the respondent exactly as they 
written on the questionnaire. pout 

2. Show the R the Picture and give him @ ask 
20 seconds to look at the picture before УО ее 
him the questions that go with each picture от 
the picture in front of him as he tells the e E 
..3. There is one page for each story to be the 

he same questions appear on each page. AS the 
questions in order for each story. Keep о et 
wording of the questions, and ask all of them 
if it may seem repetitious in some instances: 


4. Problems that you may encounter and #7 
do about them: 
" а. If the person rejects the whole tas 
о the bat by saying "I can't do this" or 
S no Imagination," resort to something ant. 
ell, just try to make up anything you er^ 
emember there are no right or wrong answ' the 
b. If, in reply to one of, the questions p 
Story form, R says "I've already told you 


ling 
e 


at (0 


Е right 
^ just 
like: 

t. 


—— 


6 ; ‚ ed 
АП interviewers were those regularly employ, 


members of the Field Staff at the Survey ues 
zenter. In addition to receiving written ins cost 
tions, two-thirds of the interviewers receive i 
communications about problems arising in t e D 
terviewing Procedure at regional conferences $ 
around the country, обшо 
_ ‘This is a departure from laboratory prot -onds 
in which a picture is shown for only 20 $5 0107 
before а Story is told. This alteration w25 d: 
duced to maintain rapport in an interview St 
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you can say, "Can you tell me some more about 
that?" 


c. When a respondent says “1 don't know" in 
reply to any of the questions about the pictures, 
again you might say, "You can make up anything 
you want. 'lhere are no right or wrong answers." 


Nore: We do not need detailed stories. АП we 
are interested in are responses which answer the 
four questions for each story. If К answers the 
question, even if only in a couple of words, do not 
probe. Probing should come only in the cases 
mentioned above (a, b, с). 


utes. Most people will give you brief stories, but 
if a person begins to give a lot of detail, remind 
him before the next story that he doesn’t have to 
give long stories. 

The collection of thematic apperceptive 
stories was accomplished in the context of a 
study conducted for the Joint Commission 
on Mental Illness and Health (Gurin, 
Veroff, & Feld, 1960). The larger inter- 
view dealt with personal feelings of adjust- 
ment and ways people manage to handle 
their adjustment problems. Previous re- 
search has conclusively shown that the con- 
tent of imaginative stories is highly suscep- 
tible to conditions immediately preceding 
the story-telling procedure. Indeed, the ex- 
perimental validity of the measures depends 
upon their sensitivity to situationally- 
induced motivation. Therefore, to approxi- 
mate a “neutral” test condition,’ the projec- 
tive measure appeared at the beginning of 
this personal interview. It was preceded 
only by a few questions about leisure-time 
activities, used to establish rapport and to 
avoid what might otherwise be too abrupt 
a beginning if the imaginative test was 
given first. Another survey study (Morgan, 
Snider, & Sobel, 1958) which included ex- 
actly the same measure of motivation but 
at the end of the interview (because of spe- 
cial rapport problems in that interview situ- 
ation) found that the stories were consider- 
ably influenced by the specific nature of the 
material discussed in the questionnaire be- 


8 In experimental studies of students, 4 minutes 
are normally allowed for each story. 

9A "neutral" test condition is one in which the 
experimenter makes no deliberate attempt to arouse 
or relax а particular kind of motivation before 
administration of the test. 


fore the thematic apperceptive procedure 
was introduced. 

The rationale given to respondents for 
introducing the novel story-telling proce- 
dure in a survey about “modern living” 
was: 

Another thing we want to find out is what people 
think of situations that may come up in life, I’m 
going to show you some pictures of these situations 
and ask you to think of stories to go with them. 
The situations won't be clearly one thing or 
another—so feel free to think of any story you 
want to. 

For example, here is the first picture. Га like 
you to spend a few moments thinking of a story to 
go with it. To get at the story you're thinking of 
ТИ ask you questions like: Who are these people? 
What do they want? and so on. Just answer with 
anything that comes to mind. There are no right 
or wrong answers. 

The interviewers reported that по major 
difficulty arose with this testing procedure. 
Many of them commented that the respond- 
ents did not appear disturbed by this novel 
interviewing procedure and, in fact, seemed 
to enjoy it. The major question we must 
ask about the effectiveness of the procedure, 
however, is whether enough standardization 
of interviewing was established to rule out 
possible interviewer effects on the measures 
of motivation obtained from the stories. 
Would different interviewers testing the 
same population elicit stories that were 
similar in motivational content? This 
problem is treated in a later section. 


Coding the Stories 


Training the Coders. One of the most 
important methodological contributions of 
previous work on these projective measures 
of motivation has been the continued con- 
cern of research workers with attaining 
high coding reliability. Feld and Smith 
(1958) have evaluated the training proce- 
dures used to teach novices the methods of 
content analysis for the three motives and 
report that conscientious use of the training 
materials they describe enables coders to 
score stories for n Achievement, n Affilia- 


10 Birney (1956) has reported evidence of an 
"experimenter effect" on the level of achievement- 
related content in stories written by college 
students. 
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tion, and n Power with reliabilities accept- 
able for research purposes. They review 
interjudge reliabilities reported in 14 pub- 
lished studies using the method of content 
analysis to be employed in this analysis and 
find scoring reliabilities ranging from .66 to 
-96 with a median of .89, They also report 
interjudge reliabilities ranging from .73 to 
2 with a median of .87 for 12 novice 
coders who had just finished learning how 
to score a set of familiar pictures using 
their training procedures (ie., after about 
12 hours of independent practice). This 
survey of the degree of objectivity in cod- 
ing obtained in 
Studies provides 


ventions not treated 
Feld and Smith (1958 


re for each S. Asa 
las of total scores at- 
ng idiosyncracies ofa 
tther experience was 
Pictures for 


et al., 1958). Check- 


Coding was 


sample of 240 stories from 40 Ss by two 
other coders!! who had previously been 
checked out as "expert" in motive SOBRE. 
Scoring reliabilities from this “warm up 

check-coding, and for others reported in 
this paper were computed in two ways. First, 
we estimated the degree of agreement 11 
noting the presence of any achievement, 
affiliation, or power imagery in the story— 
the single most important scoring ани 
for the coder. This agreement is expresse 
аз а percentage (twice the number of agree 
ments on the presence of тагына, 
imagery divided by the sum of the tota 
number of times motivational imagery was 
coded by the two coders). The second esti- 
mate of the coding reliability was the rank- 
order correlation between motivation scores 
based on all six stories coded first by the 
three coders and then by the check-coding 
experts. Although the individual percent- 
ages of agreement on “imagery” (ranging 
from 50% to 90%) were not consistently 
as high as the customary agreements foun 
in experimental research, we felt that the 
total score reliabilities (.74, .76, 
showed that these coders had attained a pon 
gree of proficiency which, augmented D) 
further clarification of scoring problems: 
Was sufficient t 
the stories in t 


_ Assessing Coding Reliability of Motiva- 
ton Scoring in the National Survey Sam- 
ple. The nine coders, each a specialist ОП 
from the male form ай 


ts ‘SSPondents’ protocols (OF 
3,238 Stories), The Criteria for scoring OF 

8 а particular 
in the ауа; 


plus Specific сопу 
pictures, 


, Each coder's total scoring task was di 

vided into three nearly equal sectio? 
(about 1.080 Stories to be scored with” 
each section). Furthermore, each code 
—— 


11 Two of th 


h 
. € authors — Sheila Feld and JoseP 
Veroff. 
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different order, 


scored the protocols in а 
from the same 5 


never scoring two stories 


9 


tained from six stories. After each check- 
coding, differences between coder and 
"expert" were discussed. The three reli- 


‚ in sequence. These three sections each 
represents a coding period for which check- ability checks provided an opportunity to 
` coding procedures were done. Within each folow the progress of coding reliability 
м third of the coding, a set of protocols were over the period of several months which it 
_ designated as reliability checks. The par- took each student coder, working part-time 
` ‘ticular stories that were to be coded by the on an hourly basis, to complete approxi 
"expert coder" were unknown to the coders. mately 100 hours of coding. The resulting 
From these protocols we estimated the reli- S aere it oc a total score 
, abilities, again both as percentage of agree- bs ih end i 5 che go T VÉ periods of 
ment in the coding of the presence of moti- a ot а Lt ё ж | 
vational imagery in the stories, and as dinis ke otis ru E 
rank-order lations of total scores ob- ташу E Е 
nk-order correlat of coding. There are occasional shifts up- 
—— ward and downward, but there are no con- 
12 For a given motive each S’s stories were sistent trends indicating any drastic effects 
y scored in such a way that two were scored by опе of coding over a prolonged period. It is our 


coder during the first third of coding, two in the 
next third by the next coder, and two in the last 
by the third coder. In this way any systematic 
bias of coding attributable to time of coding would 
be randomized over the Ss. The exception to this 
procedure is the set of stories in the reliability 
checks, All of the stories of these Ss were coded 
during the same period of coding. 


opinion that the downward trends in reli- 
ability often seemed to reflect the “expert” 
coder’s lack of adaptation to these particu- 
lar pictures. During the course of coding, 
certain specific conventions were adopted 
to establish scoring criteria for these novel 


TABLE 2 


SconiNG RELIABILITIES FOR TH 


E NINE CODERS AND THE THREE MOTIVATION SCORES 
NATIONAL SAMPLE SURVEY 


p IN THE Г 
Percentage of Imagery Agreement Total Score Reliabilities 
Motive (60 Stories) (30 Ss) 
" and Pictures 
| Codes Check | Check | Check | Check | Check | Check 
1 2 3 1 2 $ 
b 
n Achievement 2 .89 TT .81 
Coder 1 1,6 8n 5 90 
Coder 2 3,4 71 75 63 
Coder 3 2;5 87 55 82 
№ 
^ п Affiliation .91 .76 .16 
Coder 1 4,5 80 80 89 
Coder 2 2,6 83 89 78 
Coder 3 1,3 80 73 80 
Power ° ‚12 13 
V R Coder 1 1,5 ә 75 67 
Coder 2 3,6 о 73 77 
Coder 3 2,4 i: ТТ 84 
То be read Picture 1 and Picture 6 on male form and on female form. 
LI i 8 
k-ord: correlations. s ө ей ct fi 8 
: Баит ed pour to those attained in the second and third reliability checks for n Power СОЕ ced their original 


si ` ati 
дейген e “check coding sheets were misplaced and cannot be re-estimat 


б e The е because the 
unavailable ы direction of the consensus after discussion. 


n codings in 
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material. The coders were more adapted 
to these changes than the expert coders, 
who, scoring only a sample of protocols 
from time to time, tended to be somewhat 
less consistent in decisions about "novel" 
material. 

The extent to which this is true can be 
evaluated in light of a second Set of reli- 
ability coefücients that were computed after 
the coder and the expert check-coder had 
discussed every instance of disagreement 
and had come to а Consensus of opinion 
about final Coding. Correlations between 
the coders’ original scorings and this con- 


discussion score reliabilities for 


5 first, sec- 
ond, and third Checks were: n Achievement 
—91, .94, S88; п Affiliation— 94 84, 81 


Consequently, we vi 
efficients reported in Т, 


All of 
(median — 7) 
In published experimenta] s 


ported in smaller-scale 
Considering the antici- 


As 
coders were highly aw. 


4 1 are of certain idio. 
Syncratic coding соп чо 


they had 


18 First reliability estimate on Я 
unavailable (cf. Footnote ©, Table 2) Power coding 


adopted—some of which they eventually 
had to change—but some of which helped 
us get a better picture of how the scoring 
criteria apply to the stories told by this 
kind of broad population sample in an inter- 
view setting. These “new” conventions have 
been integrated into scoring manuals for 
this set of pictures! 


Correcting Motivation Scores for 
Variations in Length of Protocol 


The correlation between motivation 
Scores and length of protocol has p 
found negligible in studies with college p 
dents when 4 minutes are allowed At 
writing stories ( Atkinson, 1950, p. 27). n 
least four Studies, however, have e 
significant relationships, requiring some pé 
of correction when somewhat different be 
conditions are employed. Ricciuti on 
and Ricciuti and Sadacca (1955) found c n 
relations ranging from 48 to .59 between 5 
Achievement scores and number of M 
in the brief protocols of high school es 
dents when no Structuring questions | 
asked and when only 2.5 minutes b 
allowed for writing stories. Walker fo 
Atkinson (1958) found a correlation ^ 
between length of protocol and x 
related motivation Scores in а very hete as 
Beneous sample of Soldiers when there ki 
less than adequate control over the ас т, 
time spent Writing stories, Child, T 
and Verofr (1958) found it nm 
to correct achievement scores obtained f° - 
folk tales to remove the difference 2 


: n 
tributable to length of story, We had reas? 
to anticipate 


Wo Separate problems were anticipate, 
(a) how to eliminate protocols because ^ 
inadequate Tesponses, and (5) how to сеп 
rect for anticipated correlations betW 
— 


h 
B у 
14 Available on Tequest from Survey Ве "c 
Center (Attention: Dr. Joseph Veroff), Univ 


of Michigan, Ann Arbor, Michigan. 
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motivation scores and length of stories т 
protocols otherwise judged to be adequate 
for further analysis. 

Elimination of Protocols. At the outset 
of scoring the protocols it was clear that 
some Ss gave responses which had to be 
considered inadequate for arriving at valid 
motivation scores. А few Ss tended to re- 
ject the whole undertaking, some refused to 
give stories to some of the pictures, some 
were unable or unwilling to give answers to 
certain questions within a story. In order 
to deal with this problem, the coders were 
instructed to keep a detailed record of "in- 
adequate responses," 1.е., not answering cer- 
tain questions in a Story, saying "I don't 
know," or "It's hard to tell,” or the like. 
For апу 5 there are 24 possibilities for an 
inadequate response—four questions on 
each of six stories. Table 3 shows the fre- 
quency distribution of inadequate responses 
in terms of the criterion for elimination of 
protocols that was employed. 

What does an inadequate response mean 
for assessment of motivation? It means 
that the person gave no imaginative content 
that could possibly be scored for motiva- 
tional content. The person, for whatever 
reason, has prevented the measuring instru- 
ment from being applied. If a person Te- 
jects the whole task (24 inadequate re- 
sponses) his protocols are obviously not 
suitable for scoring. Even if the person re- 
jects the task for only one picture (i.e., four 
inadequate responses in only one story), his 
Protocols should not be considered, for he 
has not taken the complete test. Had he 
responded to that particular picture, his 
story might have been saturated with a par- 
ticular kind of motivational imagery. But 
what of the other possibilities? Neglecting 
to answer one question in a few stories, 
neglecting to answer two questions in one 
story, and so on? In Table 3 we have 
grouped the various possibilities of inade- 
quate responses according to whether they 
are judged to be omissions of minor signfi- 
cance, possibly serious, or serious. The 
judgment of the seriousness of omission 
was made on the basis of the extent to 
which each kind of omission might poten- 
tially affect the total motivation score. We 


TABLE 3 


FREQUENCY OF INADEQUATE RESPONSES TO ONE OR 
MORE or THE Four QUESTIONS ASKED TO GUIDE 
THE Story Того ABOUT ЕАСН OF Six PICTURES 


Men 
(N=715) 


Women 
(№=904) 


Adequacy of Responses 


All stories complete 491 654 
Omissions Judged to Be of 
Minor Significance 
One story has 1 inade- 
quate response 73 89 
Two stories have 1 in- 
adequate response 23 23 
Three stories have 1 
inadequate response 10 8 
Omissions Judged to Be 
Possibly Serious 
More than 3 stories 
have 1 inadequate 
response 
One story has 2 inade- 
quate responses 35 32 
Omissions Judged to Be 
Serious 
Two or more stories 
have 2 inadequate 
responses А 14 15 
One or more stories 
has 3 inadequate 
responses 25 39 
One or more (but not 
all) stories have in- 
adequate responses 20 26 
All 6 stories have 4 in- 
adequate responses 18 11 


considered this question: Had the respond- 
ent not made such omissions on his proto- 
cols, could his potential total score have 
been appreciably higher? Not giving a re- 
sponse to one question in a story does not 
subtract appreciably from a person's po- 
tential total score for that motive. Four 
such omissions might make an appreciable 
difference, and were therefore classified as 
possibly serious. 

In light of our experience with practices 
informally employed in experimental studies 
we eliminated Ss judged to have possibly 
serious or serious omissions. Thus an 575 
protocols were judged to be adequate when 
at least half the stories were complete 
and no more than one question was omitted 
from any or all of the remaining stories. 
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This criterion led to the elimination of 
17% oi the men and 14% of the women. 
Two things should be kept in mind in ap- 
praising the significance of this number of 
rejected protocols. First, we deliberately 
adopted a fairly stringent criterion for elim- 
ination. Secondly, these Ss were eliminated 
not on the basis of just one set of responses 
to one picture, but on the basis of inade- 
quacy of response on any one of six pic- 
tures. Consequently, we judge that the fre- 
quency of inadequate protocols is not out of 
line with the frequency of inadequate re- 
sponses that would be obtained from any 
complex open-ended questionnaire in а па- 
tonwide survey setting if we were to 
eliminate Ss giving an inadequate response 


on any one of a set of six complex open- 
ended questions. 


One very important question to ask about 
this elimination procedure is whether it has 
eliminated Ss selectively from particular 
Social groups. The Psychological variable 
expected to have the most important bearing 
on ability to answer the questions about pic- 
tures is verbal fluency. The demographic 
Characteristic most related to this variable 
18 education of the respondent. Table 4 
shows the relationship between level of edu- 
cation and percentage loss of data due to 


TABLE 4 
Tue RELATI 


ONSHIP OF EDUCATION To PERCEN. 
ee PERCENTAGE 


BJECTS Duk то INADEQUATE PROTOCOLS 


Men Women 
Educational Level N | Рег. | N Рег- 
т |cent-| in cent- 
Sam- | age Sam- age 
ple“ | Loss ple* | Loss 
No schooling; grade 
school (1-6 years) 107 
Grade school ш е n 
(7-8 years) 138 9; 
High school ~ | a9 " 
(9-11 years) 139 12 
Completed high School| 169 12 ры * 
Attended college 157 7 136 : 
2 Does not inc! 


lude Ss whos i 
" * зе е 
destained, ducational level Was not as- 


inadequate protocols. The loss of data 15 
most severe in the least educated groups. 
Of those who have little or no зета 
nearly one-third of the sample of men апі 
women are rejected from further analysis, 
What these data suggest, therefore, is ae 
the measuring device as here employed _ 
а survey setting is less appropriate for E 
tain segments of the population perm 
larly among the least educated groups. "- 
is hardly a startling result. Practically е 
verbal measure runs into the same duce 
Inadequate responses to open-ended yir 
questions tend to stem largely from me 
cated groups ; inadequate answers ET 
sets) on certain forced choice РИ 
also are most apparent in uneducated 810 
( Christie, 1958). #89 
In the future use of these me 
therefore, one has to realize that а но 
population responding to them appropri? a 
will be biased. Any variables pase у 
with education will have biased represe 
tion. Further examination of our data a 
indicates that the loss of Ss is heaviest i 
the low-income groups, the оссире a 
groups of lower status (unskilled WO" 94) 
in particular), and the older age levels: he 
of these findings are consistent wit? ese 
major finding on education, since дер!” 
other groups that suffer the largest heavy 
tion in Ss are groups that have а 


| Nn аиса” 
Proportion of respondents with little € 
tion. 


150 


а 
We suspect that a failure to prod". 
meaningful protocol may also reflect ation 
clencies in the kinds of social mot" ar 
that are being assessed in this study P. ү 
ticularly achievement motivation. He" 1 of 
may be Overestimating the average 16% real 
motivation in апу group from which 2 iri 
many Ss are removed because of oe pee 
adequate Protocols. This additional P fal? 
bility argues against using subsequent pul? 
from this analysis to estimate а : йб 
norms in specific groups, but it dae ci 
нае the possibility of comparing ° рсе 
groups оп average motivation scores. psti 
We recognize that we may have bias¢ 


mates in certain groups. weet 
Correction for Correlation бе گار‎ 
Length of Stories in Adequate Pr 


я 


кы 
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TABLE 5 


NUMBER or WORDS IN SIX STORIES FOR 
PROTOCOLS JUDGED ADEQUATE 


Number of words Men Women 
(№=597) | (N=774) 

80-169 32 40 
170-229 117 171 
230-289 147 215 
290-349 127 157 
350-409 77 101 
410-469 45 49 
470-529 23 21 
530-589 п 10 
590-649 11 7 
650 and above 7 3 
Median 291 279 


and Motivation Scores. The number of 
words in protocols accepted for 597 men 
and 774 women ranged from 80 to 823 as 
shown in Table 5. The median number of 
words was 291 for men and 279 for women, 
or about 46-48 words per story. This is 
exactly the same length obtained by Ric- 
ciuti and Sadacca (1955) from 2.5-minute 
stories written by high school students, but 
is approximately half the length usually 


obtained in studies with college students 


when 4 minutes are allotted for writing each 
story. Although there is no way to compare 
directly the time needed to write and to 
tell stories, one may note that the approxi- 
mate time allotted for each story in the 
instructions given interviewers (3 to 3.5 
minutes) is less than the 4 minutes normally 
allotted in studies of college students. _ 
The raw n Achievement, n Affiliation, 
and n Power scores? obtained from the 
content analysis were correlated with length 
of protocol separately for men and women. 
The results are shown in Table 6. The re- 
sults show the anticipated significant rela- 
tionships between length of protocol and 
raw motive scores, product-moment corre- 
lations ranging between 21 and .28. Clearly 


s obtained by summing the fre- 
ar kinds of motivational content 
f the six stories (see Atkinson, 


15 A total score i 
quencies of particul 
appearing m each о 


1958, Part П). 


some correction of the raw scores is neces- 
sary to eliminate the obvious fact that a 
person who tells longer stories stands a 
greater chance of obtaining a high score 
based on a frequency count of particular 
kinds of motivational content in what he 
has said. 

Before applying correction procedures, 
we simplified our considerations of the mo- 
tivation scores by putting the п Achieve- 
ment, n Affiliation, and n Power scores on 
the same scale within each sex. There is no 
legitimate basis for making direct com- 
parisons of raw scores between men and 
women on a particular motive, or between 
the raw motivation scores within each sex. 
The varying means and standard deviations 
of raw scores, for each sex and on each 
motive, are partly a function of the pictures 
cues, as already pointed out. Putting each 
motive distribution on the same scale with 
the same means and standard deviations 
allows for certain comparisons. This was 
done by assigning percentile ranks to each 
motivation score (for men and women sep- 
arately). When a number of persons had 
the same score, the median percentile rank 
of the interval covered was assigned to all 
tied scores. 

Fach distribution of percentile ranks was 
normalized. Percentile ranks were con- 
verted to normal deviate scores based on a 
mean of 50 and a standard deviation of 10 
(T scores). As before each sex was con- 
sidered separately. 

These T score conversions of percentile 
ranks effectively remove the differences be- 
tween average motivation scores for both 


TABLE 6 


Propuct-MoMENT CORRELATIONS BETWEEN Raw 
MOTIVATION SCORES AND LENGTH OF PROTOCOL 


Motive Men Women 
(№=597) | (N=774) 
n Achievement .28 .25 
n Affiliation .20 .24 
n Power 21 .21 


Note.—All р values <.0001. 
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men and women,!? but they do not correct 
for the influence of length of protocol. The 
correlations between length of protocol and 
T scores are all within —.01 of the original 
correlations between raw scores and length 
of protocol. 

The question as to how to remove the 
effect of length of protocol was resolved in 
light of the following argument. The need 
for some kind of correction is obvious. A 
person's chance of getting а higher score on 
any motive is greater the longer the story 
he tells; the greater the amount of imagery 
he produces in an imaginative Story, the 
greater the chance of it being coded as evi- 
dence of some motive. Theoretically, the 
best possible correction would be one based 
on the average of correlations between 
Scores for many kinds of motives and the 
length of protocol. This average correlation 


ematic relationship 
ol and score on any 
motivation, e.g., n 
eaningful determi- 


between some other {ур 


es of motivation and 
length of protocol. 


If we were to u 

ә se the 
Correlation between n Achievement Scores 
and length of protocol to Correct scores for 
n Achievement, it might obscure 


motiva- 


16 2908 ано significant diffe 
raw n Ас ievement, n Affliati 
scores both within and RE = $ нт 
readily explained in terms of the slight differences 
in degree to which the Sets of pictures “su, ie 
each of the three motivational concerns те din í d 
desirable to convert these Scores immediate] tc th 
same scale, ie, percentile tank, to avoid me 
misinterpretation of the meaning of different و‎ 
age raw scores. 


Tences among the 


In this study, the stories have thus far 
been coded for only three motives. Hence 
the average of the three correlations рго- 
vides the nearest estimate that can be made 
of the extent to which length of story тийш 
ences opportunity to get а high score. т 2 
average of the three correlations is .22 fo 
men and .24 for women. In each case, this 
average correlation was used to cime 
regression line of motivation scores. 3 
length of protocol! The average ia 
pected" motivation score was Челн 
for each length-of-protocol interval. 24 
average expected score at each interval W: Д 
then subtracted from the obtained anne 
score for the whole sample on each a 
separately. This difference score ab 
for each motive separately) was then dul 
as а constant correction for all indivi Ej 
Scores within particular intervals of leng М 
Thus, а correction factor was adde is 
Scores that were too low because proton 
Were short, and a correction factor W h 
subtracted from scores that were too № 
because of very lengthy protocols. E 

The effect of this correction was tO in 
move the systematic expected difference h 
motivation scores as а function of the lem 
of protocol. In Table 7 are presente 
average Т scores for n Achievement in ™ 


. or 
as a function of length of protocol о. 
and after the correction described rable 


was accomplished. We see in this hip 
that the corrections remove the relations d 
between mean п Achievement scores Kn 
length of protocol, Similar effects an Ё 
tained with Women on achievement mot! he 
tion and with both men and women ОП the 
Other motives. The over-all effects © 
Corrections аге summarized in Та (S 
Which reports the existing correlations 
Se + eg tt 
17 We Suspect that iese diret motives pe 
Achievement, n Affiliation, n Power—all mi? of 
expected to be more highly related to lent er 


" M 1 
Protocol given the interpersonal nature of the (e£ 
view, than some other kinds of motivation gsi 


0 
hunger). Hence we are well aware of the Per- 
bility that the correction employed may ftw? 
Correct" th 0 


Я This seemed the lesser that А 
evils given the compelling a priori case f en 
longer Protocol should increase the chance оопа! 
Score on any type of motiv 
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TABLE 7 
ExAMPLE OF CORRECTION APPLIED TO UNCORRECTED № ACHIEVEMENT T Scores TO REMOVE THE 
CORRELATION BETWEEN THESE SCORES AND LENGTH OF PROTOCOL 
(Males Only) 
е 
Number of Mean Uncorrected Mean Corrected 
Words in N n Achievement Correction? n Achievement 
Protocol Score* Score 
80-169 32 45.75 3 48.75 
170-229 117 47.95 2 49.95 
230-289 147 50.04 1 51.04 
290-349 127 49.96 0 49.96 
350-409 77 52.90 —1 51.90 
410-469 45 53.58 —2 51.58 
470-529 23 52.96 —4 48.96 
530-589 11 50.09 -5 45.09 
590-649 11 55.82 —6 49.82 
650 and above 7 56.43 —17 49.43 
a Scores represent normal deviate equivalents using the T score conversion of percentile ranks, the mean is 50 and standard 
deviation is 10. 
b Correction factor obtained from the regression line based on the average of correlations between n Achievement, n Affiliation, 
and n Power scores and length of protocol. Correction factor equals mean n Achievement score of the whole sample minus mean 

ГА "expected" п Achievement score for protocols of particular length. 
tween length of protocol and the final cor- between a given variable and motivation 
rected motivation scores. These correlations score (because both variables were related 
are all low and insignificant. The final cor- to verbal fluency), there should no longer 
rected scores, no longer biased by length of be any relationship. Where a relationship 

„ protocol, will henceforth be referred to as between motivation score and a variable 
the n Achievement, n Affiliation, and n might previously have been obscured arti- 
Power scores in all further analyses. factually, there might now appear a signifi- 

What effect will this correction of moti- cant relationship. 

А vation scores for length of protocol have on То illustrate some of the potential effects 
subsequent analyses? The relationship of of removing the influence of length of pro- 
motivation scores to any variable which is tocol, we have examined the relationship 
in turn highly related to verbal fluency, between education and the three motivation 
should now be uncontaminated by the factor scores before and after the correction was 
of verbal fluency. Where previously there made. We again selected educational status 
might have been an artifactual relationship as a crucial example, because it should rep- 

resent that variable which is most likely to 
reflect differences in verbal fluency. Re- 
TABLE 3 + 
proc spondents were classified according to edu- 
Pnopucr-MoMENT CORRELATIONS BETWEEN cation as follows: grade school education 


CORRECTED MOTIVATION Т SCORES AND 
LENGTH OF PROTOCOLS 


f Motive Men Women 
(N=597) | (N=774) 

n Achievement .05 .02 

n Affiliation .00 .00 

n Power —.01 —.04 


Note—All correlations are insignificant. 


only or less, some high school education but 
no college, and some college education. 
When we first relate education to length 
of protocol (Table 9) we find, as expected, 
that there are substantial differences be- 
tween the educational groupings in average 
length of protocols. Among men, college- 
educated Ss produce longer protocols than 
the two other educational groups; and 
among women, the college group is higher 
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TABLE 9 


MEAN LENGTH or PRorocoL (NUMBER OF Wonps) 
ACCORDING TO THE LEVEL OF EDUCATION 
ОЕ SUBJECTS 


Education Men Women 
(№=593)= | (N=771)* 
Grade school 256.4 240 8. 
High school 276.2 264.8 
College 303.8 290.04 


* Does not include Ss whose educational level was not 
ascertained. 


b High school mean higher than grade school mean (р < .05; 
1-tailed test). 


* College mean higher than high school or grade school mean 
(p < .05; 1-tailed test). 


4 College mean higher than high school mean (p «.01; 
1-tailed test). 


than the high school group which in turn is 
higher than the grade school group. 

From these results we are led to antici- 
pate that motivation scores will be related 
to education differently before and after 


Correction. Table 10 presents the mean n 
Achievement, n Affiliation, and n Power 
Scores for men in the three education 


groups before and 


after correction for dif- 
ferences in length 


of protocol. Table 11 
presents a similar comparison for females. 


„The correction does not h 
differential effect on th 


, however, the correction does 
make a difference, Among men, there is 
no significant difference between the n 
Achievement scores of high school and col- 
lege groups after correction where there 
was a significant difference before correc- 
tion. Among women, the significant differ- 
ence between the grade school group and 
the other groups in n Power Scores dis- 
appears after correction. The correction 
seems to have eliminated a possibly arti- 
factual difference in these two instances, 
The remaining significant differences in 
motivation score attributable to educational 
status are consistent before and after cor- 
rection. Evidently these are relationships 
between motivation and education which 
are independent of verbal fluency. We can 


at least, feel confident that the dieren 
obtained after correction are not attrıbu 
ble to differences in verbal fluency. 


The Effect of Interviewers on 
Motivation Scores 


Although we made a serious effort =. 
standardize the interviewing procedure, i 
realize that having a group of 159 map 
interviewers (27 men and 132 women) "nd 
lecting the protocols must necessarily Ad 
duce some variability into the measures 8 
has hitherto been minimal when all vr 
cols are collected at one time in à Leer 
administration. An estimate of the ys > 
of this interviewer variability will he e 
to gain some perspective concerning arts 
far the national survey interview pe uei 
from standardized administration рг 
dures. 

To establish such an estimate in аш 
data is а difficult problem, A valid tes 


TABLE 10 


ND 
MEAN N ACHIEVEMENT, N AFFILIATION: А трд 
N Power T Scores ог MALES BEFORE ao 
CORRECTION For VERBAL FLUENC 


Mean Score" 
Motive and ا‎ аа 
Education 
After 
Before tion 
Correction Corpos 
n Achievement 0 
Grade school 48.76 29.95 
High school 49.49 50 од» 
College 51.95» = 
n Affiliation 88 
Grade school 49.21 EA 22 
High school 49.97 50-27 
College 51.27 ЗЫ 
п Power | 8 
Grade school 50.97 m ў а 
High school 49.58 0.09 
College 50.23 d t 
ло 
was 371, 
* Does not include Ss whose educational level v2? 
colecained. Grade school N —176, high schoo pool 
college N = 146, high ** 
nd тше ean score significantly higher than pool 


eans (p < .05). ai 
© College mean score significantly higher than 8 
mean score (р < .05) 
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TABLE 11 


MEAN N ACHIEVEMENT, N AFFILIATION, AND 
х Power T SCORES OF FEMALES BEFORE AND AFTER 
CORRECTION FOR VERBAL FLUENCY 


Mean Score* 
Motive and 
Education 
Before After 
Correction Correction 
n Achievement 
Grade school 48.50% 49.465 
High school 50.41 51.18 
College 51.49 51.67 
n Affiliation 
Grade school 47.825 47.865 
High school 50.82 50.33 
College 51.57 50.43 
n Power 
Grade school 48.775 49.89 
High school 50.50 51.09 
College 51.56 51.47 


^ Does not include Ss whose educational level was mot 
ascertained. Grade school N =226, high school N =428, 
college № =127. 

Ъ Grade school mean score significantly 1 
school and college mean score (р < 05). 


ower than both high 


the effect of different interviewers requires 
random assignment from the population ot 
respondents {0 the interviewers. This is 
decidedly not the case with the Survey 
Research Center's field operation. АП inter- 
viewers are assigned respondents within 
their general geographical locality, and 
sometimes more specific locations are as- 
signed particular interviewers. Especially 
in a metropolitan area, there are gross dif- 
ferences between the sample of respondents 
interviewed by one interviewer contrasted 
with another: some interviewers specialize 
in low income, foreikn speaking, ог Negro 
areas. While the selection of respondents 
is completely determined at the central office 
of the Survey Research Center by proba- 
pility sampling techniques, these respond- 
ents not assigned to interviewers at 
random. To the degree that the assignment 
is not random, there may be systematic dif- 
ferences in responses obtained by various 
interviewers that stem from differences т 
f the samples interviewed. 


are 


the composition 0 


and not from differences attributable to the 
interviewers themselves. In spite of this 
danger of finding effects that are really at- 
tributable to respondents, we wanted to 
arrive at some estimate of interviewer 
effect. We therefore designed an analysis 
of “possible” interviewer effects which 
would be least subject to criticism that we 
had grossly confounded interviewers and 
characteristics of respondents. Our analysis 
undoubtedly still can be criticized on this 
score. But in any case, a violation of the 
assumption that the respondents of each 
interviewer represent a random sample of 
the population should in most cases con- 
tribute to an overestimation of interviewer 
effects. As a result the estimates to be de- 
scribed can probably be viewed as maximal 
estimates of interviewer differences. 

We have examined possible interviewer 
effects in a limited sample of the total popu- 
lation of interviewers. The limitations in- 
troduced to approximate the assumption of 
equivalence among respondents for each 
interviewer were: 

1. Interviewers to be considered should be white 
females. We eliminated from analysis all Negro 
and male interviewers because both are likely to 
be assigned to very specific populations, Negro 
interviewers to Negro residences and male inter- 
viewers to poorer residences. 

2. Interviewers to be considered should be doing 
their work in nonmetropolitan sampling points. 
The primary sampling unit (PSU) in this survey 
generally corresponds to counties in states. Some 
of these PSUs are metropolitan. As already 
stated, in metropolitan areas, there is a greater 
tendency for the interviewers to be selectively as- 
signed to various subpopulations. We avoid the 
grossest errors in meeting the assumption of ran- 
dom assignment of respondents by not considering 
metropolitan PSUs. 

3. Interviewers to be considered should have 
interviewed at least four male respondents and/or 
four female respondents who gave story protocols 
judged adequate for motivational analysis. A study 
of interviewers who interviewed fewer than four 
Ss would provide too unreliable an estimate of 
interviewer effect. 

4. Interviewers to be considered should be paired 
with a qualifying interviewer (not eliminated by 
Criteria 1, 2, and 3 above) from the same PSU. 
We imposed this limitation to allow for inter- 
viewer differences within PSUs. Our failure to 
meet the assumptions of random selection of re- 
spondents among interviewers should be somewhat 
compensated by the fact that we are dealing with 
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a relatively homogeneous (nonmetropolitan) popu- 
lation of respondents in which possible interviewer 
effects will be examined. 

Given these limitations, we were able to 
find 8 pairs of interviewers for male 5$ and 
15 pairs of interviewers for female Ss. Each 
pair represents a different PSU. We de- 
signed an analysis of variance of motiva- 
tion scores broken down into the following 
components: variance attributable to Ss (or 
individual differences), variance attributa- 
ble to interviewers, and variance attributable 
to PSUs. The procedure?? for the analysis 
is described by Anderson and Bancroft 
(1952). The component variance attributa- 
ble to interviewers is segregated from the S 
variance and its percentage contribution to 
S variability can be estimated. Separate 
analyses of variance were conducted for 
male and female Ss for each of the three 
motives. In these analyses, we estimate the 
significance of the variance assignable to 


18 We are indebted to 


h 1 Leslie Kish for his sug- 
gestions with procedure, 


interviewer effects and, in addition, we seg- 
regate the component variance for inter- 
viewers and examine the extent of its effect 
on score variability. Table 12 presents these 
analyses for men and Table 13 for women. 
Tables 12 and 13 show that a "possible 
interviewer effect on motivation Scores 18 
statistically significant in three instances: n 
Achievement scores for women, and n 
Power scores for both men and women. 
Furthermore, when component variance pt 
interviewers is specifically segregated, 1 
effect on score variability is estimated 45 
an intraclass correlation from which we ket 
derive the proportion of variance 1n ges 
vation scores possibly contributed by ШЗ) 
viewer differences ( Snedecor, 1946, р. 2 for 
These were negligible for men except in 
n Power where the percentage increase е 
variance was .29. Among women, the Pio: 
centage increase was negligible for n am й 
tion but .18 for n Achievement ай 
Power. Considering these estimates a 
maximal estimates, we feel safe in cone к: 
ing that we have achieved a fair degree 


TABLE 12 


ANALYSES OF VARIANCE OF MALES’ MOTIVATION SCORES 
(Ss within Interviewers within PSUs)» 


Source of Variation df MS F quia" Intraclos*, 
0 
ЖИЫ кь... Interviewers Correla" 
للا‎ 
n Achievement 
ee 1 96.77 
Я атат тшп PSU 8 97.78 1.38 09 
s within interviewer * . 6.81 
Total “a 70.42 
n Affiliation [o oM EE Lot. 
PSU 
шерине. within PSU d t 08 
5 within interviewer t 1.33 8.47 à 
Total E 101.27 
n Power L——. 
PSU 
Interviewer within PSU 1 220.00 29 
Ss within interviewer 48 £70.13 2.65% 26.51 : 
T s 64.06 
a Limited to cases of white, female inte: 


БЫР pF 


уь 
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TABLE 13 


ANALYSES OF VARIANCE OF FEMALES’ MOTIVATION SCORES 
(Ss within Interviewers within PSUs)* 


Component 
Source of Variation df MS F Variance for | Intraclass 
Interviewers | Correlation> 
n Achievement 
PSU 14 118.00 
Interviewer within PSU 15 174.40 1.90* 20.70 18 
Ss within interviewer 90 91.24 
Total 119 
n Affiliation 
PSU 14 90.14 
Interviewer within PSU 15 72.13 .88 —2.33 —.03 
Ss within interviewer 90 81.44 
Total 119 
Ё TU 14 86.14 
Interviewer within PSU 15 107.27 1.88* 12.61 .18 
Ss within interviewer 90 56.84 
Total 119 


a Limited to cases of white, female interviewers of 
measures, 

ъ Ratio of interviewer component variance to 

жр < .05. 


standardization in interviewing techniques 
for these measures of motivation. The three 
significant interviewer effects will be ex- 
amined in later work to explore their po- 
tential psychological significance. For the 
present we must recognize that we have 
some evidence of possible influence of inter- 
viewers on some motivation scores but ap- 
parently the influence is not substantial 


enough to warrant a great deal of concern. 


One reason these maximal estimates of 
tivation scores are 


interviewer effects on mo 
low is that the motivation scores have been 
corrected for length of protocol. Similar 
analyses of variance were run using number 
of words in the protocol as the dependent 
variable. Table 14 shows that for both men 
and women respondents there is a sub- 
stantial interviewer effect on length of pro- 
tocol. The effect is especially strong among 
male SS. The correction of motivation 
scores to eliminate the effect of differences 
in length of protocol has removed most of 
the possible interviewer effect оп these 


scores. 


nonmetropolitan PSUs who interviewed at least four Ss giving adequate 


S variance plus interviewer component variance. 


Over-all Evaluation of Methodological 
Problems Encountered 


These, then, are the methodological prob- 
lems encountered in the survey setting and 
our evaluations of the extent to which they 
remain as problems or as sources of error 
in subsequent uses of the thematic apper- 
ceptive measures of motivation. One prob- 
lem was solved—the bias of motivation 
scores attributable to verbal fluency. One 
problem appears very difficult to solve—the 
inadequate protocols produced by unedu- 
cated persons. A third problem should be 
resolved by further empirical analysis—pos- 
sible bias in the selection of pictures. The 
two remaining problems—lower coding reli- 
ability than in experimental studies and 
possible bias in motivation scores attributa- 
ble to interviewers—are ones that lower 
the reliability of measurement in subsequent 
analyses using the motivation scores. But 
both of these sources of error appear to be 
ones that certainly can be greatly reduced 
in subsequent studies. Experimental in- 
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TABLE 14 


ANALYSES OF VARIANCE ОЕ 
(Ss within 


MALES’ AND FEMALEs' LENGTH or PROTOCOLS 
Interviewers within PSUs): 


Component i ied 
ND ў 7 i f ntrac| 
Source of Variation df MS F LOU Correlation? 
For Males 
PSU 7 13.91 73 
Interviewer within PSU 8 12.92 12.99** 2.96 . 
Ss within interviewer 48 1.06 
Total 63 
For Females 
PSU 14 9.94 28 
Interviewer within PSU 15 3.50 2.00* 401 * 
Ss within interviewer 90 1.75 
Total 119 
ited to cases of white, fe: 


1 administration, а 
the desired length of Stories, 


INTERPRETATION OF Motivatioy 
If methodolo the sort 
ious Section 


sessment, cam. he ey i 
tribute to the stud. 
problems in surye 
methodological р 


© per sonality as- 
begiu to con- 


У research, y, 


"е attribute 


15 the issue Now to be 


The problem of interpretation of йү 
tion scores Obtained from thematic apP ; 
ception is not a new one ( Atkinson, к 
McClelland, et al., 1953) ; but the broa a 
setting of a national survey study EE 
ates the difficulties that are normally i 
countered even in fairly Ба ао 
experimental studies. Interpretive pnus at- 
are particularly highlighted, as we Wi we 
tempt to point up in this section, wh een 
examine the relationships between | ation 
graphic Characteristics and gene 
Scores. The national survey which = the 
ОВ to compare the motivation scores 1 tion 
different Subgroups of the larger pop ter- 
also serves to accent theoretical and | over 
Pretive problems that are often glossed 
in laboratory research. 


[4 
х у ce th 
Three factors are thought to influen! à 
frequency of 

} S el 
measure or 


the strength of a particular 


ОЁ motivation) . (a) individual di 
In strength о 

and enduring 
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motivation for particular goals to the extent 
that the situation portrayed is relevant to the 
life experience of the respondent; and, (c) 
the immediate life situation of the person 
may arouse or dampen motivation for par- 
ticular goals. The latter influence may be 
some factor in the immediate test situation 
of the respondent, or some more general 
but transient influence on his life situation, 
or some more general and longer lasting in- 
fluence on his life situation (e.g. constant 
achievement pressure of the job) at the 
time the measure is administered. Both 
pictures and a person's life situation provide 
cues that define expectations concerning 
what kinds of goals are relevant and reach- 
able, and how attractive it would be to 
reach the goals. 

In previous sections we have discussed 
the steps taken to minimize or control the 
influence of some situational and picture 
effects in order that motivation scores ob- 
tained in this survey could be interpreted 
with some confidence as manifestations of 
individual differences in strength of mo- 
tives. Many of these steps (1.е., control of 
the interview setting, attention to the type 
Of pictures employed, corrections for dif- 
ferences in length of protocols) do, un- 
doubtedly, enhance the possibility of view- 
ing the obtained scores as measures of 
differences in personality. Nevertheless, we 
believe this assumption should still be con- 
sidered tentative. 

While we attempted to remove biasing 
effects of pictures, we have no independent 
evidence that we have in fact removed all 
possible bias. For instance, can we assume 
that the pictures employed, which portray 
relatively young men and women in com- 
топ life situations, will have the same 
meaning for young, middle aged, and aged 
respondents? Furthermore, the standard- 
ization of the interview situation might 

control the immediate situational infimences 

at “he time the test was administered but 
obviously could have no effect on life condi- 
tions affecting the person at the time the 
survey was conducted. Can we assume, in 
advance, that the active young businessman 
is being tested under situational circum- 
stances that are comparable to those of an 
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older man enjoying retirement from busi- 
ness and the stress of day-to-day activities? 
Or must we conceive of the possibility that 
these gross differences in the total life situ- 
ation in which the administration of our 
test of motivation is embedded will produce 
effects comparable to those produced by ex- 
perimental manipulation of situational influ- 
ences in studies of motivation in college 
students ? 

Therefore when our results show that a 
particular kind of motivation is stronger in 
one social group than another, our task is 
not finished ; rather, the difficult task of in- 
terpretation is now thrust upon us. We 
may, in light of earlier research, assume 
that the combined effect of picture cues, 
situational pressures, and dispositional dif- 
ferences is greater in one group than in 
another. We have located a difference in 
motivation which now requires some kind 
of theoretical interpretation. The inference 
that the result means the two groups differ 
in strength of a general and enduring dis- 
position of personality requires the assump- 
tion, and supportive argument, that the 
effects of picture cues may be considered 
equivalent in the two groups and that the 
situational pressures for expression of the 
motive, whether transient or relatively en- 
during, are also equivalent. Often the argu- 
ments in support of one or another assump- 
tion will appear thoroughly convincing. 
For example, the assumption has generally 
been made in laboratory experiments with 
college students that in view of the relative 
homogeneity of both their present and past 
life conditions, differences in motivation 
scores could be assumed to reflect disposi- 
tional differences. Just as often, however, 
we shall find ourselves with a problem of 
interpretation which cannot be settled within 
the framework of the present results. In 
these cases, the identification of important 
motivational dite: s and the ambiguity 

wy he Wrterpretation will deime am TOGO 

tant problem for future research. 

It can be argued that certain life situa- 
tions can have lasting effects on a person's 
disposition so that “situational” differences 
become “personality” differences. Little is 
known concerning how permanent are the 
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effects of certain life experiences, particu- 
larly ones that occur later in life. The as- 
sumption of the primacy of early experi- 
ences in the generation of personality dis- 
positions has been widely accepted. Facing 
frankly the paucity of evidence concerning 
dispositional versus situational factors in 
motivation, and their interactions, we can 
see that any one of the following arguments 
might be pursued in the interpretation of 
obtained differences between certain social 
groups. 

1. The obtained difference is attributable 
to differences in enduring personality dis- 
positions acquired early in life. 

2. The obtained difference is the conse- 
quence of a change in Personality disposi- 
tion induced by important situational fac- 
tors later in life. 

3. The obtained difference js а conse- 
quence of exposure to differential tempo- 
тагу situational pressures affecting the level 
of aroused motivation regardless of equiva- 
lence in underlying personality disposition. 

Thus two problems remain that must be 
faced in confronting the substantive results 
Obtained with the thematic apperception 
Instrument: assessments of the bias of pic- 
ture Cues and the influence of the general 
life situa vation scores. 


tion on the moti 
These problems will be illustrated in the 
sults that follow. 


discussion of substantive re 


In Tables 15 to 22 ar 
the obtained FE 


n Affiliation, and n Po 


arise when we are finall 
the task of arriving at a valid interpretation 
of the motivation Scores; and Secondly, to 
provide researchers who 1 

particular social groups some descriptive 


information concerning potential differences 
in motivation among these groups. Under 
no circumstances should these data be con- 
sidered unequivocal norms about streng 
of motives in particular social groups unti 
the issue of the extent to which obtained 
differences are situational, dispositional, or 
artifactual is settled. Furthermore, under 
no circumstances should all of the oi 
ences presented in these tables be mu 
reliably different from one another. 
have highlighted some that illustrate e 
problem of interpreting motive scores, ei 
we have reserved the problem of statisto 
significance for the substantive appraisal o 
results in future articles. For readers Roms 
would like some bearings on the rp 
mate differences in percentages required = 
а statistically significant comparison e 
tween two independent groups, we Lois 
included in the Appendix a table of diffe 

ences required for significance (.05 pt 
with different sample sizes (Ns). ТІ i 
table is based on approximate sampling 
errors of differences in percentages. ae 
of the analyses of the differences in P 
centages in the following tables red” 


H " 15 
Statistical treatments other than what | 
involved in usin 


g the table in the Apper ^ 
(correlation, matched comparisons, wl 
tailed tests of Significance). This Appen ^ 
table is presented merely to provide E- 
proximate estimates of the statistical sig" 
cance of obtained differences. j- 
In all of the tables, 5$ have been wd 
fied as "High" or “Low” in motiva 
Score by dividing the distribution of C? 
tected T scores as near to the median ^ 
the distribution for the whole sample ^. 
was possible The tables indicate W, 7 
Percentage of particular subgroups are а, 
(above the median) in the national samp 6 
“ect cross comparisons between men ай 


t 
Women are not justified since different a 
forms Were employed for the two sexes and 
the distributions 


a 

of scores for p sif 
women Were considered separately in d€ 
nating high and ] 


OW scores. 
Education 


€ 


. ‚ ва № 
in an earlier section we examined г 
correction for differences in length of P 
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tocol as it affected the relationship between 
level of education attained by the Ss and 
mean motivation scores. In Table 15 the 
same result is presented in terms of the 
percentage of scores in each of three educa- 
tional levels that are above the median for 
the whole sample for each motive. The per- 
centage of college educated Ss, whether 
men or women, with high n Achievement 
scores is substantially higher than the per- 
centage of Ss who have had only a grade 
school education. Among women, the same 
result appears in connection with affiliation 
Scores and power scores. 

Now how would one interpret these dif- 
ferences? Are the picture cues biased in 
favor of the college population? Has the 
life experience of a college education 
affected the achievement motive of college 
women? Or were college women high in 
achievement motivation from an early age, 
and thereby managed to get to college? We 
cannot resolve these different views. Later 


TABLE 15 


EDUCATIONAL LEVEL RELATED ТО N ACHIEVEMENT, 
м AFFILIATION, AND N POWER 


Percentage of 
Motive* and High Scores* 
Education М 
Men Women 
oe Mr 
n Achievement 
Grade school 48 44 
High school 49 49 
College 62 52 
n Affiliation 
Grade school 47 43 
High school 47 53 
College 51 52 
n Power 
Grade school 55 42 
High school 47 51 
College 49 54 


in all subsequent tables, the motive measi 
=. Heart corrected for verbal fluency. семак 
* Does not include Ss whose educational level was not 
ascertained. Grade school Ns, ме = 176, Women = 222; 
high school №, Men — 271, Women = 422; college Ns, 
en = 146, Women = 127. е 
» Here and in all subsequent tables, high scores refer to the 


Scores on each motive measure that are above the median for 


the total sample of men or women. 


evidence, however, may bear on which in- 
terpretation is more appropriate. 


Occupation 


In Table 16 are presented the percentage 
of men presently employed on a full-time 
basis in various occupations who obtained 
high scores for each of the three motives. 
Achievement motivation scores are much 
more frequently high among persons in 
higher status occupations than among men 
in lower status occupations. 


No systematic relationship between n 
Affiliation or n Power scores and occupa- 
tional status is clearly apparent, though 
certain possibly meaningful differences in 
the different occupations may be discerned. 
Whereas 5196 of managers and proprietors 
score high in power motivation, only 43% 
of professionals and 40% of clerical work- 
ers, the remainder of the white collar occu- 
pation category, score high in power moti- 
vation. Should the difference here be con- 
sidered a manifestation of differences in 
basic personality disposition which has led 
one group of men to positions of super- 
vision and another not, or should the differ- 
ence in motivation scores be considered a 
reflection of the difference in motivational 


TABLE 16 


OccuPATIONAL LEVEL RELATED TO N ACHIEVEMENT, 
N AFFILIATION, AND N POWER 
(MEN EMPLOYED FULL-TIME ONLY) 


Percentage of 
Occupation of High Scores 
Respondents N |) = 
n Ach | n Aff | n Pow 
Professionals 67 60 54 e 
Managers and 
proprietors 70 59 56 A 
Clerical workers 30 51 47 47 
Sales workers 34 59 4T 50 
Skilled workers 120 | 80 | A59 
Semiskilled workers 88 52 41 50 
Unskilled workers 38 45 37 51 
Farmers 39 44 31 


" is not 
* Does not include Ss whose occupational ae dto these 
ascertained or Ss whose occupations were not СО 

categories, 
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influences contained in the day-to-day life 
situation of managers, as a group, versus 
those classified as professionals? Or, could 
the result conceivably be attributed to some 
biasing in the content of the picture cues 
favoring one group and not another with 
respect to the suggestion of power-related 
imagery? Again, the matter of interpreta- 
tion must await the extensive analysis of 
these and related findings that will be the 
substance of future papers based on this 
investigation. 


Income 


Table 17 presents the relationships be- 
tween motivation scores and family income. 
The irregular relationship between income 
and n Achievement scores in men clearly 
Suggests that other demographic factors 


TABLE 17 


FAMILY Income RELATED TO 


N ACHIEVEMENT, 
N AFFILIATION, AND 


N POWER 


Percentage of 
High Scores 


—— | Ne 


Men 


Motive and 
Family Income № 


Women 


n Achievement 


Under $1,999 66 | 42 42 |122 

$2,000-3,999 114 | 54 42 | 187 

ү ео) 100 | 47 48 |125 
,000—6,99 158 | 56 

$7,000-9,999 и |1 


$10,000 and above 60 


57 52 52 
n Affiliation 
Under $1,999 42 47 
$2,000-3,999 54 47 
$4,000-4,999 41 51 
$5,000-6,999 49 55 
$7,000-9,999 52 50 
$10,000 and above 48 52 
n Power 
Under $1,999 58 43 
$2,000-3,999 49 | 45 
$4,000-4,999 a7 | а 
$5,000-6,999 48 56 
$7,000-9,999 47 57 
$10,000 and above 55 40 


^ Does not include Ss 


whose family income 
В was 
certained. iin 


must be controlled in an effort to clarify E 
result. However, oddly, in the case о 
women, there is a positive relationship е 
tween achievement motivation scores an 
family income. 


Аде 


Perhaps the clearest illustration of К 
problem of interpretation gf sore n 
effects of personality disposition vers : 
situation effects is apparent in ешын 
of age trends, particularly in the p 
ment motivation scores of men ( Table 18). 

There are some notable differences v 
tween the age groups in achievement К 
vation (for both men and women). s 
example, the highest achievement € 
tion is found among the youngest men, = 
the next highest among а middle-ag' 


TABLE 18 


х 
eILIATION: 
AcE RELATED TO N ACHIEVEMENT, N AFFILIé 
AND N PowER 


Percentage of 
High Scores № 
Motive and Age | № | 
Men | Women| — 
n Achievement 53 
21-24 32| 66 47 | 905 
25-34 165 | 50 50 |184 
35-44 119 | 57 50 |152 
45-54 123 | 54 45 98 
55-64 82 | 42 52 71 
65 plus 16 47 891. ue 
n Affiliation 
21-24 47 53 
25-34 48 53 
35-44 44 55 
45-54 49 53 
55-64 53 36 
65 plus 49 38 | ص‎ 
n Power 
21-24 44 55 
25-34 45 52 
35-44 52 47 
45-54 51 49 
55-64 49 45 
65 plus 54 " 


ч ined’ 
* Does not include Ss whose age was not ascertal 
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group (35-44 years), while the lowest 
achievement motivation scores appear in the 
older groups. A number of alternative in- 
terpretations of these differences can be 
offered. On the one hand, these differences 
may reflect some important generational 
differences in the strength of achievement 
motive dispositions. On the other hand, the 
stage of a man's life cycle may have a great 
deal to do with his current aroused achieve- 
ment motivation—the eagerness of youth 
Starting out in a career and the middle-aged 
Dressure to "achieve or else" may both lead 
to heightened achievement motivation in 
contrast to the pressures of old age that 
may detract from achievement arousal. 
From other results we have obtained thus 
far, relating achievement motivation to 
other variables (controlling for age), we do 
find that the age groups vary considerably 
in what achievement. motivation signifies. 
One can at the present only speculate about 
a life cycle or a generation difference influ- 
encing these results, and perhaps both sets 
of factors are crucial. 

The age differences obtained may reflect 
Picture biases as well; the possibility of 
Systematic age differences in the connota- 
tion of the pictures cannot be excluded. 
Very old people, for example, might have 
Special difficulty in generalizing from the 
Pictures presented to their own life situa- 
tions since the figures in the pictures are 
Not clearly old in appearance. » 

As a result of these obtained age differ- 
ences in motivation scores and our specula- 
es about their meaning, we have x 
line СЕ as a critical variable for e: “a 
е re Of other substantive relationships 

Ported in future papers. 


Race 
On 
e Я Е densa 
suberna the most apparent instances where 
1р differences in motivation scores 


may reflec 


icture bias 15 
the compar pic 


the operation of i 
for white an 


Ne 150п of scores І 
tures pee sPondents (Table 19). The pic- 
Presented to all respondents tOr the 


mos 1 

What me, Carly portrayed white oct 
ы 15 Negro 

Person? It es does this have for a №65 


as been found that the $06107 
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economic position of the Negro can play as 
important a role in determination of some 
of his responses as does his race (Korchin, 
Mitchell, & Meltzoff, 1950) and that there 
is some tendency for Negroes to be less 
guarded in their responses to pictures of 
white persons (Cook, 1953). Other research 
has also explored this and related problems, 
but no unassailable conclusion has been 
drawn about the relative merits of using 
pictures of whites or Negroes with Negro 
respondents (Light, 1955; Riess, Schwartz, 
& Cottingham, 1950; Schwartz, Riess, & 
Cottingham, 1951). 

Comparisons between white and Negro 
respondents are also limited by the imme- 
diate test situation. Some Negroes were 
interviewed by white interviewers, although 
some of them (especially those in the 
South) were interviewed by Negro inter- 
viewers. Previous research dictates that we 
make note of this difference, although its 
implications are not entirely clear. Schwartz, 
et al. (1951) find that Negroes express 
more ideas to а TAT if the interviewer is 
white. 

Therefore, we do not have firm empirical 
ground for interpreting the meaning of the 


TABLE 19 


RACE RELATED TO N ACHIEVEMENT, N AFFILIATION, 
AND N POWER 


Percentage of 
High Scores* 
Motive and Race 
Men Women 
n Achievement - 
White a a 
Negro 
n Affiliation j 
White 5 E 
Negro 
n Power du 
White A г 
Negro 


ined or Ss 
i whose race was not ascertained 
ee not include Si able into these categories. White Ns, 
eg таса Women = 688; Negro Ns. Men = 31, Women 
Men = 538, 
60. 
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differences found in Table 19. At the 
present we can only note that white women 
seem to be higher in achievement motiva- 
tion scores, and Negro men are higher than 
white men in power motivation. The other 
comparisons show minimal differences. We 
can ask would we have obtained other dif- 
ferences had we used pictures of Negroes? 
Of course we cannot answer this question 
in the present study. Where we will use the 
Scores of Negroes, we will treat them as if 
there were no picture biases. Later results 
may lead us to reconsider the problem. 


Place of Residence 


In Table 20 we present the distributions 
of motivation scores according to the place 
of residence—whether in a metropolitan 
area or their suburbs, a small city, a small 
town, or a rural area. These findings 


TABLE 20 


CURRENT PLACE 


OF RESIDENCE RELATED TO 
N ACHIEVEMENT, 


N AFFILIATION, AND N PowER 


Percentage of 
Motive and High S 
Place of Residence | № аван N 
ا‎ 
Men | Women 
ا اا ڪڪ‎ | E. 
n Achievement 
etropolitan areas 86 | 48 45 
Suburbs | 87 | 52 42 E 
Small cities 92| 51 43 1 
Small towns 180 | 56 49 155 
Rural areas 225 | 53 49 230 
n Affiliation 
Metropolitan areas 4 
Suburbs а D 
Small cities 48 49 
Small towns 48 54 
Rural areas 51 46 
n Power 
Metropolitan areas 5. 
Suburbs 56 Е 
Small cities 46 46 
Small towns 44 51 
Rural areas 50 48 


^ Metropolitan areas and their suburb: 
United States Census Bureau Classification: sued td by the 
populations of over 50,000, small towns ро ulati: s ep have 
50,000, and rural areas are ons of under 


А оред farm со 
standard metropolitan area, untry not located in a 


present some differences in scores that are 
not completely independent of status con- 
siderations noted in other comparisons— 
income, occupation, or education differences. 
Nevertheless there are noteworthy results: 
For both men and women there is greater 
concentration of high achievement motiva- 
tion scores in small towns and rural areas 
than in larger communities. High affiliation 
motivation seems to be more prevalen 
among men from smaller communities, bu 
highest in women living in metropolitan 
areas. Power motivation seems to be rela- 
tively high in metropolitan areas. 

These differences may have important 
implications for the thesis that the type к 
community one lives in can determine ү 
development of certain motivational inter 
ests. For example, one might want to say 
that a consequence of living in smaller com 
munities is the development of a heighten. 
interest in affiliation and yet there rema 
one possibility that such an interpretatio 
is in error. Motivation can play a large m 
in determining the kind of commun! 
people live in, whether they are willing oe 
remain in a city or in a rural area, wo 
they migrate to suburban areas, an EN 
like. For example, men with high абай те 
motivation may prefer small towns whe | 
intimacy is easier. 


Broken Home Background and 
Death of a Spouse 


In the foregoing sections we have a 
Sested that many of the differences іп a 
vation scores obtained can be as plausib У 
interpreted as Situational differences as Me f 
ren personality differences, The P 
Tectness" of either interpretation, ОГ a 
relative importance of each interpretato, 
Should be resolved in further study. wo 
1$ evidence that the resolution of this prone 
lem will not be a general one; one ОГ tiy 
other interpretation will not be uniforms. 
арргоргіаќе, but both will be involved. ee 
evidence comes from examining two ee 
Special groupings—broken home b? 
ground, and death of a spouse. ar 
All respondents were asked whethe? its 
not they had lived with both of their ра" 
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before the age of 16. Through further 
probing we were able to group respondents 
into three categories: 

1. Coming from intact homes. 

2. One or both parents died before the respond- 
ent was 16. 

3. Parents were divorced or separated before 
respondent was 16. 
These experiences in early childhood are 
likely to be profound catalysts for inducing 
basic personality patterns. We feel that 
differences found in adults with these three 
kinds of early backgrounds might reflect 
early personality integration of motivation. 
In these comparisons (Table 21), parental 
divorce seems to have had effect on achieve- 
ment motivation—lowering it for males and 
raising it for females. Death in the family 
lowers affiliation motivation in males, but 
not in females, and either death or divorce 
raises power motivation in males and lowers 
it in females. We have found a sizable 
relationship between the age of the respond- 
ent and experiences of family disruption by 
divorce. Proportionally more younger than 


TABLE 21 


BROKEN Номе BACKGROUND RELATED TO 
N ACHIEVEMENT, N AFFILIATION, AND N POWER 


Percentage of 
Motive and High Scores* 
Home Background 
Men | Women 
n Achievement 
Intact home 54 48 
Parent(s) died 45 42 
Parents divorced or separated 38 60 
n Affiliation 
Intact home 49 49 
Parent(s) died 39 52 
Parents divorced or separated 46 51 
n Power 
Intact home 47 51 
Parent(s) died 61 ES 
Parents divorced or separated 62 44 


include Ss whose home background was not 

3 Deed E 58 whose home background was not codable 

ee categories. Intact home Ns, Men — 471, Women — 

E Parent(s) died Ns, Men = 80, Women = 125; Parents 
divorced Ns, Men = 24, Women = 45. 


older respondents make up the group from 
divorce backgrounds. Even with age con- 
trolled, however, most of the differences 
apparent in Table 21 remain; for the most 
part, the differences cited occur at each age 
group considered. 

Explanations of these differences would 
rest, we argue, on considering what impli- 
cations these early life experiences have on 
motive development. For example, we 
might suggest that divorce of one's parents 
can have a lasting effect on a child's 
achievement motivation because the disrup- 
tion creates significant early experiences 
and attitudes towards the competence of 
both the father and the mother. Usually 
children remain with the mother when there 
is a divorce. Resentment or criticism of the 
father must be inevitably transmitted to the 
child. This resentment added to the mere 
absence of an intimate contact with the 
father can have opposite effects on the 
achievement motivation of boys and girls. 
A boy, having lost his masculine model for 
achievement, may become highly involved 
in avoiding failure. In so doing, his 
achievement motivation, his positive motiva- 
tions for success, become weakened. On the 
other hand, girls living with a divorced 
mother have a readily available model for 
achievement identification. Resentment of 
the father can reinforce a need for feminine 
independence and self-reliance in a mascu- 
line world. The fact that her mother is 
apparently self-sufficient further enhances 
an image of the achievement orientation of 
women. This is overgeneralized speculation 
about the dynamics of divorce and achieve- 
ment motivation. Nevertheless, it is ап 
example of how we would treat теа | 
in motives found in family backgroun 
comparisons. Having obtained differences 
in motivation scores for these groups We 
can proceed with an interpretation of dif- 
ferences by assuming that the differences 
reflect enduring personality changes brought 
on by family disruptions in early childhood. 
What is perhaps most important 15 that 
these potential differences in motivation are 
attributable to dispositional differences. 
Therefore, the motivation score ca reflect 
personality differences. 
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The last table we are going to consider is 
one which compares two groups of men and 
women—those presently married and those 
whose spouses have died (Table 22). We 
present this table only for older (50 and 
above) men and women because the number 
of widows and widowers under 50 is rela- 
tively low. Differences without this age 
control might be attributable to age rather 
than the loss of a spouse. There are inter- 
esting differences in the table. For all mo- 
tives, except power motivation in men, the 
loss of a spouse yields a decrease in score 
which is primarily a situational change. 
The general conclusion might be that losing 
one’s wife or husband in later life for most 
people has the effect of decreasing their 
motivational concerns—a conclusion not at 
all out of line with current thinking about 
problems in old age. At any rate we have 
evidence that a life situational factor in 


later life can have a strong effect on moti- 
vation scores. 


SUMMARY AND CONCLUSIONS 


We have reviewed some of the methodo- 
logical and interpretive problems attending 
the use of thematic apperceptive measures 
of motivation in a nationwide interview 
study. In light of this exploration, it is 
possible to offer both evaluations of the 
difficulties encountered and a prognosis con- 


cerning these measures in substantive те- 
search within the survey setting. 
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Selection of Pictures. The selection of 
separate sets of six pictures for men and 
for women was guided by previous research 
concerning the effect of picture cues on 
thematic apperception. The percentage of 
stories elicited by these pictures which con- 
tained imagery related to achievement, affili- 
ation, and power was lower than anticipated 
from pretests but sufficient, we believe, to 
allow for valid assessment of individual and 
group differences in the strength of the 
three motives. 

We have no independent basis for evalu- 
ating the extent to which the objective was 
attained of providing a fair opportunity for 
persons in all strata of American society to 
express their motives. Only intensive anal- 
ysis of the responses to particular pictures 
by particular segments of the population 
(reserved for a later paper) will provide 
the kind of evidence needed for further 
evaluation of the adequacy of the pictures 
that were employed and clarification of the 


issues to be faced in subsequent attempts 0 
this sort. 


Interviewer Effects. In light of the 
known sensitivity of thematic apperception 
to conditions of administration, we antici- 
pated and found that despite our efforts to 
standardize the interviewing procedure, the 
interviewers did (apparently) contribute 
significantly to the variance of some (but 
not all) of the motivation scores. However; 
we were gratified to find that our estimates 


TABLE 22 


MARITAL STATUS RELATED TO N ACHIEVEMENT, N 
(Age 50 and Above Onl 


AFFILIATION, AND м PowER 
y) 


Percentage of High Scores 


Motive Men Women 
i Widowed Married Widowed 
(N27) (52122) (№=93) 
n Achievement 4T 37 
n Affiliation 50 41 E = 
n Power 48 63 a 13 
43 


سے 
 _‏ —— 
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of interviewer effect—maximal estimates as 
a consequence of possible confounding of 
interviewer and respondent effects in survey 
data—were no higher than in fact they 
were. That interviewer effects on motiva- 
tion scores were less than anticipated can 
be attributed to the correction employed to 
remove the relationship between raw moti- 
vation scores and length of protocol. Inter- 
viewers differed considerably in the average 
length of the imaginative protocols they 
Obtained. Correction of motivation scores 
for this factor effectively eliminated much 
of an otherwise substantial interviewer 
effect. 


" As a consequence of bias attributable to 
interviewers still remaining in some of the 
Motivation scores, We anticipate some loss 
in the sensitivity of the measures (1.е., an 
increase in error variance) in later analyses 
of substantive relationships, €.g. relation- 
Ships of motivation to demographic vari- 
ables. 


Reliability of Coding. The reliability of 
coding attained by a team of nine relatively 
novice student coders, while less than that 
normally attained with this method of con- 
tent analysis in small-scale experimental 
studies of college students, was nevertheless 
encouragingly high. The range of coding 
reliability coefficients for the motivation 
scores assigned to individuals was .72 to 91 
With a median of 77. All of these estimates 
of coding reliability are within the range of 
reliabilities reported in earlier published 
studies. However, in comparison with the 
median standard of 89 attained in experi- 
mental studies, in this study with new pic- 
tures and other novel problems there is a 
loss in reliability of coding and hence some 
increase in the error of measurement. We 
expect, therefore, an additional loss in the 
sensitivity of the measures in later analyses 
substantive relationships. 


Inadequate Protocols. A 
mber of individuals produced protocols 
dged inadequate for further 
analysis. The over-all loss of data amounted 
to 17% among men and 14% among 
women. In both sexes, the loss was heavy 


and serious, a5 might have been anticipated, 


of 
substantial 


nu р 
that were ju 


only among the least educated groups. 
Hence, the average motivation scores of 
some subgroups in the population, specific- 
ally those in which level of education is 
very low, must be considered biased esti- 
mates—derived only from the most verbal 
members of those groups. 


Verbal Fluency. One problem of consid- 
erable importance, the relationship between 
motivation scores and length of imaginative 
protocol, which is described by correlations 
of 20 to 28 in this national sample, seems 
to have been quite adequately overcome. 
The correction devised reduces these corre- 
lations to the range from —.04 to .05 and 
effectively removes the possibility that sub- 
stantive relationships, like the significant 
positive relationship between n Achieve- 
ment and education of respondent, can be 
considered spurious because of a depend- 
ency of motivation scores оп verbal fluency. 
As mentioned earlier, this correction also 
produced a very substantial reduction in the 
extent of variability of motivation scores 
attributable to interviewer differences. 


Interpretation of Motivation Scores. Dif- 
ferences in motivation Scores in various 
subgroups of the population can be vari- 
ously interpreted as reflecting differences 
in: personality, life situation, and the sig- 
nificance of the pictures for various groups. 
The relative importance of each of these 
interpretations. may be evaluated only by 
further inquiry into the kinds of variables 
related to these scores within different sub- 
groups. А particular network of interrela- 
tionships may be suggestive of which of 
these views may be supported. It might 
very well be that the interpretation of the 
scores will vary depending on which social 
group is being investigated. Or, one conclu- 
sion that scems likely is that for each social 
group the motivation score can be partially 
interpreted as personality assessment and 
partially as an assessment of reactions to 
ongoing life situations. There is evidence in 
the survey data that each of these interpre- 
tations may be operating in the scores. Mo- 
tivation scores reflect differences both in 


early life experiences (broken home back- 


30 J. VEROFF, J. W. ATKINSON, S. С. FELD, ann G. GURIN 


ground) and later life experiences (death 
of a spouse). 

These alternative considerations of the 
conceptual meaning of the motivation scores 
precludes the consideration of their relation- 
ships with demographic variables as unequi- 
vocal norms describing subgroup differ- 
ences in personality. The results have been 
presented here with the hope that they can 
Serve as background for exploring the sig- 
nificance of motivation differences in 
society. 

If methodological precautions like those 
outlined here are seriously considered, the 
use of thematic apperceptive measures of 
motivation (and other sensitive techniques 
of personality assessment) can be expected 
to make marked contributions in the study 


of important substantive problems in survey 
research. Careful inquiries into the rela- 
tionship of these measures to other psycho- 
logical variables and to demographic indices 
in this survey, and in subsequent national 
studies, should greatly enhance our under- 
standing of the social origins and conse- 
quences of motivation. At the same time, 
survey studies will provide insights that are 
needed to refine the thematic apperceptivé 
method itself. The use of thematic apper- 
ception to assess motivation in survey 
studies of national character will mean that 
factual evidence concerning personality 
configurations within American society can 
be integrated with the fund of factual А 
Чепсе concerning the dynamics of mona 
tion already derived from experimental an 

clinical use of these same measures. 
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WATER REGULATION IN THE RAT: 


II. THE ARTIFICIAL CONTROL OF THIRST WITH 
STOMACH LOADS OF WATER AND SODIUM. CHLORIDE: 


LAWRENCE I. O'KELLY 
University 


2 er the study of motivational vari- 
ables in relation to other behavioral 
phenomena has become а matter of increas- 
ing interest, the actual number of ways in 
which motivation can be controlled is indeed 
small. In animal experiments under hunger 
ог thirst motivation, procedures have been 
limited almost exclusively to variations in 
the techniques of deprivation. A brief de- 
Scription of some of the variants of the 
natural" procedure will serve to clarify the 
need for additional methods of controlling 
motivational variables. The purpose of this 
monograph is (a) to describe a procedure 
for "artificially" manipulating the thirst 
drive of rats by the placement of stomach 
loads of varying volume and composition, 
and (b) to present a series of standardiz- 
ing, parametric studies of some of the prin- 
ciple variables involved. 


THIRST CONTROL 


Natural Methods 


Subjects maintained оп deprivation 
schedules differing in duration. The short- 
comings of this procedure are several. 
Under differing hours of water deprivation 
the Ss have different degrees of associated 
hunger, it having been shown that food in- 
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take is less when water is absent from the 
animal's living quarters. Thus, thirst and 
hunger effects are confounded. Further, if, 
having maintained Ss on a particular depri- 
vation schedule, one wishes to shift them 
abruptly to a different schedule, the Ss may 
turn out to be somewhat less flexible than 
the experimenter. The animal adapts to 
particular maintenance schedules only grad- 
ually, and changes in behavior following 
schedule disruption cannot be attributed 
solely to changes in “drive strength,” for 
example. Differences in duration of depri- 
vation often must be quite large in order to 
produce behavioral differences of measur- 
able amount. This means that Ss may have 
to be run at irregular times which may fall 
in different parts of the same or different 
diurnal cycles, a factor which is usually re- 
flected in greatly increased performance 
variability. Another serious hazard of de- 
privational methods arises when they are 
used for producing drives of high intensity. 
Long deprivation periods (longer than 36 
hours of water lack) inevitably result in 
debilitation and inanition when continued 
long enough to be of any experimental 
value, thus introducing another confound- 
ing factor. Гог example, in situations 
where performance declines from maxima 
achieved at intermediate deprivational 
levels, it has always been difficult to deter- 
mine whether drive increases continuously 
with hours of deprivation. 

Predrinking. Sometimes all Ss are main- 
tained on a common schedule, but are al- 
lowed to drink different ‘amounts of water 
prior to testing or training. This, however, 
has the disadvantage that differential pre- 
drinking itself is a variable affecting per- 


to 


formance (Bruce, 1937, 1938; Miller, 
Sampliner, & Woodrow, 1957). Moreover, 
if an investigator wishes one of several 
groups to be maintained on a long or severe 
deprivation schedule, then all Ss must be 
maintained on that schedule, with the conse- 
quent health risks as noted above. 

Running at varying times after drinking. 
А variation of the above procedure is to 
maintain all Ss on the same Schedule, but 
гип them at varying intervals after drink- 
ing. This has some advantage over a pre- 
drinking method, but still does not solve the 
differential hunger problem since Ss typi- 
cally eat just after they have drunk. One 
can readily shift hours of deprivation sim- 
ply by running the Ss at different times 
after drinking; but if the Ss have learned 
to perform at a certain time it is difficult to 
know whether to interpret the change in 
due to differences in 


Maintenance at 


у а percentage of bod 
weight. This depriv rn d 


‹ ation is commonly used 
With both rats and Pigeons for the control 
of hunger, but does not appear to be well- 
suited to control of thirst. Whereas а 10- 
20% Weight reduction with food depriva- 
tion makes а good active animal for much 
Tesearch, such a Water loss could not be 
tolerated by the rat, Most important per- 
haps, 15 the fact that the effects ‘of a 
Weight-controlling Procedure would inevita- 
ly be confounded with the effects of the 


t appropriately be attrib- 
mated, 


«Irtificially Controlled Th irst 


Because of the many 
inconveniences attached t 
solely through duration 
tion, we have expended 
on the study of alternati 
plishing the same ends, 
of very clear 
methods. All < 
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any differences in the motivational states 
they are under during the actual experi- 
mental observations. A variety of intensities 
of thirst can even be induced in the same 
animal on different occasions without 
change of the daily maintenance senean 
An earlier study (O'Kelly, 1954) showed 
that the effects of water loads are inde- 
pendent of each other if there is a two-day 
recovery period between loads; in the ex- 
periments to be reported in this paper, thé 
same is true for a variety of NaCl logas, 
although the length of the recovery регіо 
is slightly longer. ЖУР 
Another important advantage of artificial 
control is that variations in thirst can к. 
obtained and sustained independently © 
variations in consummatory activity, thos 
permitting the study of relationships be- 
tween various parameters of tissue need amo 
behavior without the complications intro 
duced by behavioral consequences of ү 
consummatory activity itself. Whether 
Physiological imbalances vary concom" 
tantly with drive is still an important р 
answered question ( O'Kelly & Heyer, 195 ^ 
Towbin, 1949). Wayner and Reiman 
(1958) found that running speed increas 
as a function of time after hypertonic jas 
injections, but that the amount of wate? 
consumed remained constant. Experiments 
of this type illustrate the advantages of be 
ing able to influence separately those vat 
ables of tissue disequilibrium from [ee 
temporal and spatial supports with whic 
they are inevitably associated in natur? 
deprivation, 


Methods of Control 


Tt is well known that drinking can 
stimulated artificially by introducing inte 
the animal's blood stream a number of solt- 
tions of Concentration hypertonic to that 0° 
the plasma Gie., having an effective osmotic 
С than that of the plasma): 
* has been the substance ai 
`6. as the normal major extr?" 
lyte, it produces the best com 

drinking enhancement W! 


- he 
еса toxic or other side effects. by 
Solution тау be delivered to the plasma ^ 


-—— a2 
—————À 
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a variety of routes: intravenously, intra- 
peritoneally, or subcutaneously by injection ; 
directly into the hypothalamus or third 
ventricle by stereotactically placed hypo- 
dermic needles; or, finally, introduced into 
the digestive tract by fistula or esophageal 
tubing. Since the present monograph is 
concerned only with use of the stomach 
loading approach, we will limit our com- 
ments on the other techniques to those re- 
sults that have stemmed from our own 
previous trials of the subcutaneous injec- 
tion of saline. 

Subcutaneous injection. Proceeding from 
the findings of Gilman (1937) and of 
Holmes and Gregersen (1950) that intra- 
venous injections of hypertonic sodium 
chloride solution increased water intake, 
Heyer (1951) showed that similar enhance- 
ment of drinking in the rat could be induced 
by subcutaneous injections of 15% NaCl 
solutions. Further, by varying the amount 
of NaCl injected, water intakes equivalent 
to those observed after 12, 24, or 36 hours 
of water deprivation could be secured. 
Further standardization of the injection 
technique was done by Wayner (1953) 
and Wayner and Reimanis (1958). While 
drinking can be potentiated by this means, 
certain disadvantages are characteristic of 
the procedure. The most serious is the skin 
damage and pain to animals that may ac- 
company too long a series of repeated injec- 
tions. This limits the number of days over 
which unconfounded observations may be 
made. Also, by the nature of the treatment, 
it is practicable only to enhance drinking 
from any base reference level, whereas an 
ideal technique of drive manipulation should 
permit controlled variation from satiation 
on up (and possibly down, in the direction 
of water excess). While drinking potentia- 
tion is a function of the total amount of 
salt, rather than its concentration, drinking 
inhibition is a function of, volume; sub- 
cutaneous injections of volumes of sufficient 
size to limit materially the subsequent water 
intake would be impracticable to administer. 

Stomach loading. For use with rats or 
other animals of a similar size, it would 
appear that loading of fluids directly into 
the stomach via esophageal tubing is the 


better method. The rat’s stomach will ac- 
commodate relatively large volumes, thus 
giving the experimenter a greater latitude 
of choice in both volume and concentration 
of administered fluid. Since the rat has no 
regurgitatory reflex, fluid is not lost during 
or after placement, as is often the case with 
hypodermic injection. Also, quite impor- 
tantly, the route through stomach and in- 
testine is the usual path taken by ingested 
fluids, and the various steps along the way 
from stomach to ultimate arrival in the 
extracellular fluids will more nearly involve 
normal physiological processes and normal 
latencies of action. 

Administration of fluids in this manner 
appears to be relatively painless, and it is a 
matter of usual observation that animals do 
not develop avoidance habits in relation to 
the experimenter or the general laboratory 
situation. As the technique is used in our 
laboratory, the rats are lightly etherized be- 
fore loading. This reduces struggling, and 
the procedure can easily be accomplished 
with a minimum of difficulty by one person. 
The time for etherization and loading nor- 
mally range from 1.5 to 2 minutes. General 
recovery from the anesthetic is rapid, the 
animals appearing grossly normal within 5 
minutes. There is a small but reliable de- 
pression of drinking as a result of exposure 
to ether (O'Kelly & Weiss, 1955), but this 
does not appear to interact in any specific 
manner with load effects. 

Previous studies using the stomach load- 
ing technique have demonstrated that (a) 
water consumption by 23.5-hour water- 
deprived rats can be depressed in quantita- 
tive fashion by varying volumes of water 
loads, or can be enhanced by hypertonic 
NaCl loads (O'Kelly, 1954); (b) bar 
pressing may be similarly enhanced or de- 
pressed by such loads (О’КеПу & Falk, 
1958); and (c) runway performance 1$ 
affected in a like manner (Solarz, 1958). 
Thus, the technique is capable of producing 
behavioral effects consistent with the ma- 
nipulated need state. Finally, water regula- 
tion has been studied in some detail by 
means of this and related procedures so 
that such physiological variables as rate of 
clearance of fluids from the stomach and 
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their intestinal absorption following loading 
are partially understood (O'Kelly & Falk, 
1958). 

The studies presented in this monograph 
are a continuation of the program cited 
above. They constitute a further extension 
of our efforts to quantify the control of 
thirst in the rat by means of stomach load- 
ing technique, and to study relations be- 
tween the need state of tissue dehydration 
and resulting behavior. 


OUTLINE oF GENERAL PROCEDURE 


With the exception of various details of 
management of experimental variables, the 
procedure used in the drinking studies (Ex- 
periments 1-4) was the same for all, 
can be outlined first. 


Subjects. The Ss in Experiments 1-4 were ex- 
perimentally naive male albino rats of the Holtz- 
man strain, from 100 to 130 d 
Start of the experiments, 
ratory chow 


and 


nt was started they did not re- 
€ home cages until it had been 
completed, 

Adaptation and recovery 
experimental treatment the 
ir. water deprivation sched 
day they were brought fr 
the laboratory where they w 
given access to water for a 
individual drinking boxes, 
Were weighed at 
special cages, 
days, and for 


the regular time, but drank in 
described below, 


Apparatus. The appar: 


| atus consisted of drinking 
boxes, metabolism cage 


Й $, and water tubes. The 
drinking boxes, 4 in, by 11 in. by 6 in. high were 


constructed of wood and had hinged wire tops. 
The metabolism cages, circular in shape and made 
of hardware cloth, were 6 in. high, 8 in. in diam- 
eter, and rested on double thicknesses of fine 
copper wire screen set in the mouths of glass 
funnels. Underneath each funnel was а 50-cc. 
centrifuge tube for the collection of urine, the 
feces being screened out. Funnels and screens 
were cleaned before each session to insure uncon- 
taminated urine. The drinking tubes were 100-сс. 
gas-measuring tubes, graduated to 0.2 сс. into 
which were placed drawn glass drinking spouts. 


о 
E 


The ends of the drinking tubes were eee a. 
the drinking boxes and metabolism cages abo 
1 in. above floor level. 


Solutions and drinking water. All palin. ВЫШЕ 
tions were made with distilled water, wine A 
weight/volume ratio (Pfaffman, Young, Det = 
Richter, & Stellar, 1954). All loads were givn 
room temperature. All drinking water wag «o 
tap water, given at room temperature. po ат 
concentration of the load solutions was сһес 
by chemical analysis. | 

Load day procedure. Оп load days аан 
were weighed at the regular drinking time a 
the exact volume of load was a кы n 
animal Each 5 was then pe dise nd 
lacing it in a muscum jar satu ‹ 1 
fomes and given its xe by posed mie 
atheter), as has been previo! ded А 
Okai 1954). It was then put ng its peat 
lism cage while it recovered from the e 7 
15-min. delay period was used in all р LR 
experiments except 4, which was speci qoe 
study of the effects of delay of access to i. үт 
after loading. At the end of the delay perio de 
water tube was inserted into the cage and RS 
intake and urine output were recorded at M 
intervals for 30 min, 15-min. intervals tor edi 
next 90 min., and at 30-min. intervals for the in 
120 min., a total of 4 hr. At the end of 2 hr. i: 
centrifuge tubes collecting the urine were chang ы 
so that separate urinalyses could be made on 
first and last halves of the drinking period. 

Urinalyses. Immediately following their ee 
tion, all urine samples were put into screw 
vials and refrigerated until they could be ап 
lvzed. In the first two experiments analyses kp 
made for chloride content, using the method о. 
Schales and Schales (1941). In the later drinki т 
experiments sodium апа potassium content WE 


1 р ho- 
determined as well, using a Beckman flame P 
tometer. 


EXPERIMENTS | AND 2: Tug Errects 0х 
WATER AND SODIUM CHLORIDE Loans OF 
THIRSTY AND SATIATED RATS? 


Method 


Subjects and ex 
Sixty rats were 


on the 4-hr, 


PR. C. Beck and L. I. O'Kelly, 
Sponsibility for design and execution 
Parts of this series varies 


Periment, authorship will 
fashion. 


Since the 22 

of the vario 

from experiment to His 
be indicated in t 
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TABLE 1 


VOLUMES, CONCENTRATIONS, AND ABSOLUTE NaCL 
CONTENT OF LOAD SOLUTIONS 


Volume 
% of NaCl Concentration 
Body (grams/litre) 
Weight 
1 o 3.0 5.8 9.1 12.0 15.3 18.5 
2 0 3.0 4.5 5.8 7.4 9.1 
3 0 20 3.0 3.9 4.9 5.8 
g 0 15 2.3 3.0 3.6 4.5 
5 0 1.2 1.7 24 30 3.6 
| Isosals 
(grams NaCl/kilogram body weight)” 
0.3 0.6 0.9 1.2 1.5 1.8 


Note.—All loads were used in Experiment 1. Italicized loads 
were used in Experiment 2. 

a “0” indicates tap water load. 

» Due to slight errors in mixing solutions the isosals are 
medians for the five different volumes of load. 


There were five different NaCl concentrations at 
each volume, The treatments are summarized in 
Table 1. 

Three characteristics of Table 1 should be 
pointed out. First, the concentrations at each vol- 
ume were selected so that the highest concentra- 
tion would approach a near-lethal dose and so 
that there would be equal steps between the con- 
centrations at a given volume. The differences 
depart slightly from equality due to slight vari- 
ability in mixing the solutions, A lethal dose was 
estimated as being an absolute amount of salt 
equal to about two grams per kilogram of body 
weight. This estimate was based on extrapolation 
from results in preliminary studies which had in- 
cluded a few actual lethal doses. 

Secondly, each of the five concentrations at à 
given volume represent the same absolute amount 
of salt as are found in the concentrations at any 
other load volume. Consequently, we have two 
ways of studying the effects of concentration of 
EA on drinking : (a) by varying the volume 
of load and holding the absolute amount of salt 
constant (reading down each of the columns in 
Table 1); or, (b) by holding the volume constant 
and varying the amount of salt (reading across 
each of the rows in Table 1). A given amount of 
NaCl, in varying volumes of solution, is referred 
to throughout this paper as an isosal. There are 
in this experiment five isosals, the tap water loads 
not being considered as such. 

Finally, the 3.096 concentration was included at 
the 196 volume $0 that this aS would 
appeat with all volumes. ir alone does not have 
an isosal counterpart at all volumes. 


The 60 5$ were divided into two groups of 30, 
making two simultaneously run replications of the 
experiment, one in the morning and the other in 
the afternoon. Each S received a different load 
on each of three separate days, with a four-day 
recovery period between loads. Treatments were 
randomly assigned to animals with the restrictions 
that (а) по 5 received the same load twice, and 
(b) each treatment occurred equally often on 
first, second, and third load days. There was a 
slight departure from the last restriction in that 
there were five, rather than six, observations for 
six of the 31 treatments. Because it was impos- 
sible to load and observe more than 10 animals in 
a half-day, the experiment was staggered, each 
treatment “day” being divided into three separate 
daily sessions, with 10 animals loaded in each 
session. 

Subjects and experimental design, Experiment 2. 
Whereas the first experiment used Ss 23.5-hr. 
water-deprived at the time of loading, this study 
dealt with the effects of such loads on "satiated" 
rats, Le, rats in a state of water balance at the 
time of loading. Since it was deemed desirable to 
maintain Ss on a drinking schedule, satiation was 
achieved on load days by giving them their regular 
30-min. drinking period. They were then removed 
to a common retaining cage and following а 1-Вг. 
delay, a time previously shown to be sufficient for 
water to clear the stomach and be absorbed from 
the intestine (O'Kelly, Falk, & Flint, 1958), were 
loaded and given their 4-hr. drinking period. 

Thirty rats were used, weighing an average of 
369 grams on the first load day, about 8 grams 
heavier than the Ss in Experiment 1. The only 
difference in design from Experiment 1 was that 
the hypertonic loads were given at only three 
volumes (1, 2, and 3% body weight) and no 
water loads were given. The volumes and con- 
centrations used are those in italics in Table 1. 
Since each animal was loaded three times, there 
was a total of 90 loads, 6 Ss for each of 15 


treatments. 


Results of Experiments 1 and 2 


In order to facilitate comparisons, the ге- 
sults of both experiments are presented 
simultaneously. Before doing so, however, 
it is necessary to comment on the statistics. 
Unless otherwise stated the analyses were 
three-way analyses of variance: load con- 
centration X load volume X day of loading 
(first, second, or third). Since the number 
of Ss having any particular sequence of 
treatments was too small to be analyzed, it 
was assumed that each S’s three different 
treatments were independent of each other, 


and that the number of independent obser- 
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vations was equal to the number of loads ferences between replications, they were 


given. In Experiment 1 only the treatments combined. All the treatments in Experi- 
including the five isosals common to all ment 2 were included in the analysis. Only 
volumes were used in the analyses; water significant F ratios are reported, these being 


loads were excluded. There being no dif- given at appropriate places in the text. 
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Water Intake 


Figure 1 summarizes the 30-, 60-, 120-, 
and 240-minute water intakes as a function 
of volume and concentration of load. For 
the deprived animals, in the early part of 
the drinking period (30-60 minutes) drink- 
ing increases in a near-linear fashion with 
load concentration up to some optimal point 
of concentration, and then declines. With 
increasing volumes the optima occur with 
Progressively smaller concentrations and in 
fact occur at each volume with approxi- 
mately the same amount of salt, between 
0.9 and 1.2 grams per kilogram of body 
weight. The exact optimal salt load is in- 
determinate from these data, but data from 
another experiment indicate that the opti- 
mal concentration for 2% volume is about 
5%, or 1 gram/kilogram of body weight. 

The form of the 30-minute intake curves 
differs between the deprived and satiated 
Ss. The 3% volume curve is similar in the 
deprived and the satiated groups, but the 
296 deviates at the 5.896 concentration, 
which is lower than the 4.5% when it is 


'expected to be higher. The 1% volume 


curve shows little patterning at 30 minutes. 
Individual drinking curves show that the 
reason for this discrepancy between the 
satiated and deprived groups is the ability 
of some of the satiated Ss to go for long 
periods without ingesting water. For ex- 
ample, one S given a 3% volume of 6% 
NaCl (the maximum volume and concentra- 
tion in Experiment 2) drank nothing in 
four hours. A number of other animals 
failed to drink in two or three hours. 

Total intake for four hours tends to be 
proportional to the amount of salt loaded 
in both experiments, except that the most 
concentrated loads at any volume produced 
an inversion in this amount of time. With 
drinking periods longer than four hours, 
presumably all would be monotonic func- 
tions. 

Separate analyses of variance were calcu- 
lated in each experiment for intake during 
the first two hours, second two hours, and 
for the total four hours. The results were 


essentially the same for both experiments, 


except that there was a significant day effect 
in Experiment 1, the total mean intakes 
being 43.7, 46.5, and 50.1 cc. on the succes- 
sive load days. (This effect is studied more 
exhaustively in Experiment 3.) The most 
potent influence was the amount of salt in 
the load, with volume playing a much 
smaller role, and with almost no interaction 
between the three variables. With increas- 
ing load volumes there was a corresponding 
decline in drinking, presumably due to the 
extra water included in the larger loads, a 
circumstance requiring the animals to take 
in a lesser fraction of their water by 
drinking. 

Further analyses were computed for the 
two experiments jointly, using the 1, 2, and 
3% volumes from Experiment 1. The de- 
privation effect was highly significant dur- 
ing the first two hours (Ё = 130.96, but not 
during the second two hours (F < 1.00). 
When the amount predrunk by the satiated 
animals was added to their first two-hour in- 
take, however, the F value for deprivation 
was reduced literally to zero. The average 
four-hour intake for the deprived animals 
at the 1, 2, and 396 volumes was 48.80 сс. ; 
that of the satiated animals, with predrink 
added, was 47.56 cc. It seems clear that 
the deprived animals drink first of all to 
offset their 23.5-hour deficit, then to coun- 
teract the salt loads. 

The initial high intake rate of the de- 
prived animals as compared to the satiated 
is demonstrated clearly in Figure 2, which 
shows the rate of intake over four hours as 
a function of isosals, with all load volumes 
combined. These curves, though somewhat 
stylized due to combining, still reflect accu- 
rately the individual volume curves. For 
the deprived animals, at all volumes, the 
initial depression of drinking due to high 
concentration loads is gradually made up 
throughout the four hours, whereas at the 
lower concentrations the animals take in 
their water quickly (30-60 minutes) and 
drink very little thereafter. The satiated 
animals drink at a slower rate in the first 
hour than do the deprived, but with the 
higher isosal loads they, too, drink steadily 
throughout the four hours. 
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Fic. 2. Four-hour drinking rates as a function 
of isosal in Experiments 1 (deprived) and 2 (sati- 
ated). (Time zero on the abscissa is when the ani- 


mals were given access to water after loading. For 
the Н.О 1 


loads and each isosal, the rates for all 


load volumes have been combined.) 
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Fic. 4. Four-hour urine rates for Experiments 1 
and 2 as а function of isosal. (The initial higher 
rate with the 1.5 gm/kgm isosal than with the 1.8 
appears to be due to the animals with the 1.5 load 
having a diarrheic load effect earlier than those 
with the 1.8 load. In Experiment 1 the anti- 
diuretic effect of low concentration loads is seen 
where the 0.6 isosal load is followed by lesser 
urination than the Н.О load.) 


hours. The predrink and hour's delay be- 
tween predrink and loading of the satiated 
animals evidently does not influence delay 
of onset of urination after loading, rate of 
output, or total urine flow. 


We find then, with respect to both water 
intake and urine volume that there are no 
differences in the deprived and satiated 
groups other than the extra water needed 
by the deprived animals to offset their pre- 
viously incurred 23.5-hour deficit. 

The upper part of Figure 5 shows the 
urinary chloride concentrations for both ex- 

eriments as a function of isosal. Since 
urines were collected separately for the first 
two hours and second two hours, this “half” 
variance was included in the analysis. Once 
again, the isosal effect is very large, and in 
Experiment 2 it is the only significant vart- 


ance. Ап unexpected occurrence was the 
highly significant difference between halves 
in Experiment 1, where the second-half 
urine is much more dilute than that from the 
first half. This effect did not occur in Ex- 
periment 2, however, and а combined anal- 
ysis of variance for the two experiments 
shows an F value of 24.86 between deprived 
and satiated in the second half. 


The animals have two urinary means of 
ridding themselves of the imposed NaCl 
load. They can excrete a larger volume of 
more dilute urine or a smaller volume of 
more concentrated urine. As Figure 5 
shows, the animals have a fairly constant 
increase in total urine volume, as a function 
of isosal, over the four-hour period. Hence 
this mechanism of chloride excretion (vol- 
ume) is being used throughout the entire 
range of isosals. The concentration of 
urine, on the other hand, increases only up 
to the second or third isosal, then remains 
at a level of about 2% (although individual 
animals were able to concentrate their urine 
as high as 39%). The lower part of Figure 
5 shows the product of concentration and 
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volume, the total amount of chloride ex- 
creted, in milligrams. Whereas the rats 
may have reached an upper concentration 
threshold in both halves of the drinking 
period, they apparently had not reached a 
volume limit, nor a limit for chloride excre- 
tion in the second half. Otherwise, there 
would have been some asymptotic limit ap- 
proached for chloride output in the second 
two hours, such as is indicated in the first 
two hours. Apparently, there could have 
been a still greater volume of urine pro- 
duced had there been more concentrated 
loads given. Load concentrations above 
those in this experiment run dangerously 
close to lethal dosages, however, so it may 
be that urine volume would increase right 
up to the final lethal dose. 

In relation to the usefulness of various 
loads for behavioral experiments, it may be 
noted that the occurrence of diarrhea fol- 
lowing loading is of some practical interest. 
Diarrhea occurred commonly at the two 
highest isosals in Experiment 1 (about 40% 
of the cases), much more frequently than 
in Experiment 2 probably due to the pres- 
епсе of more water within the intestinal 
lumen, The number of cases, by increasing 
isosals, was 0, 3, 6, 13, and 12 in Experi- 
ment 1 and 0, 0, 2, 4, and 1 in Experiment 
2. A test of the difference in Proportions at 
the two highest isosals was significant be- 
yond the .01 level, Using only the three 
lowest volume’ loads in Experiment 1 for 
this comparison, the significance level was 
between .05 and .10. In Experiment 1 there 
were about an equal number of diarrheas at 
all load volumes (range: 6-9), except at 
19% where there were only 2. In Experi- 
ment 2, the distribution was 2, 4, and 1, 
with increasing volume of load. 


Interload Day Weight and 
Drinking Changes 


Weight and drinkin, 
after loading are shown in Figure 6. Since 
there was no regular drinking period on 
load days in Experiment 1, the day before 
loading was taken as the baseline, The sati- 
ated animals have an extra recovery day 


g changes on days 


since the second and third load days are 
"recovery" days from the previous loads. | 
In both experiments the increase in 
weight following loading reflects the total 
intake on the previous load day. The weight 
changes increased with increasing isosals 
up to the fourth (1.5 gm/kgm), then de- 
clined sharply at the fifth. In Experiment 1 
there was an overall correlation of .51 be- 
tween load day intake and weight change 
on the day after. Water loads, however, 
had no effect on weight. In each experi- 
ment, separate analyses of variance were 
computed for weight and drinking changes 
on each post-load day until there were no 
Significant effects. T' tests were computed 
for the difference between baseline days and 
days after, using all observations, irrespec- 
tive of treatment (Н.О loads excluded). 
The main difference between the satiate 
and deprived animals is that the deprived 
were still significantly overweight at the еп 
of four days, while the satiated animals nes 
turned to their preload weights in this 
period of time. As expected, concentration? 
was the most important variable (a signi 
cant effect until the third post-load day) 
with some small influence of volume. The 
Sreater regularity of the curves for d€- 
Prived Ss in Figure 6 may simply be а 
matter of reliability since there were more 
Ss in the first experiment, but it more likely 


corresponds to the greater regularity of in- 
take on load days. 
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Discussion 


The first major result of the present ex- 
periment is that drinking, urine volume, and 
urine concentration are mainly a function 
of the amount of salt loaded. This agrees 
with the findings of Wagner and Reimanis 
(1958) for subcutaneous salt injections. 
Load volume effects, while significant, are 
comparatively slight and seem to be due to 
the fact that, since higher volume loads de- 
part less from isotonicity than small volume 
loads, they require the animals to drink less 
water to offset the experimentally-produced 
deficit. 

The second critical result is that salt loads 
can depress drinking, as well as potentiate 
it. Thus, whereas the lower concentration 
loads produced immediate drinking at a 
high rate (due partly, of course, to the 23.5- 
hour deficit), the higher concentration loads 
produce a lower but more persistent rate of 
drinking. This suggests (with supporting 
evidence from Experiment 6) that the 
higher concentration loads are not well- 
suited for use in behavioral experiments, at 
least with the 15-minute delay after loading. 
Direct observation of the stomachs and in- 
testines of animals loaded with high concen- 
tration solutions shows them to be very 
distended, filled with osmotically-induced 
water from the tissues. Overall activity 15 
considerably reduced. The occurrence of 
diarrhea serves as a check on this observa- 
tion. 

It would seem a good guess that the 
highest concentration load practical for the 
control of behavior woull be one at, or 
slightly below, the optimal load for potenti- 
ating drinking in a 30-minute test period 
us pere 1). The loads up to this point 
which F show an increment of drinking 
drops ox ptgsimately linear and drinking 

s rapidly after about an hour (Fig- 
ure 2). In other words, tl loads seem 
to produce an immediat. nese: logi = 
deficit which is сате change ш water 
А offset within a minimal 
time. 

The great similarity between the deprived 
and satiated animals is important with re- 
spect to the physiology of salt-load regula- 
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tion and to the management of behavioral 
experiments. The deprived Ss seem to show 
more regularity than the satiated, however, 
as shown in their more orderly drinking 
curves. While larger numbers of animals 
for the treatments under the two conditions 
might produce completely analogous drink- 
ing results, the lesser variability of the de- 
prived group with only a few animals per 
condition is just a point in its favor. One 

of the sources of greater variability for the 

satiated animals is the fact that they can 

manage the load either by drinking or by 

excreting. Some of the satiated animals 

seemed able to utilize body water very 

effectively for kidney function during the 

first part of the drinking period and took in 

their "environmental water" later in the 

period. The deprived animals, of course, do 

not have as much tissue water to expend in 

kidney regulation at the time they come 

into the test period. 


EXPERIMENT 3: THE EFFECTS OF 
REPEATED WATER AND МАСІ, Loans 
ON THIRSTY Rars* 


This experiment was designed to study 
the cumulative effects of six repeated load 
treatments, using a six-day intertreatment 
interval rather than the four-day interval of 
Experiments 1 and 2. For reasons just 
discussed, concentration values which were 
on the accelerated portion of the 30-minute 
intake curve (Figure 1) were selected. Of 
the five volumes in Experiment 1, the 2% 
body weight volume seemed to give the 
greatest amount of drinking. The concen- 
trations used in this volume did not appear 
to be such as to overly irritate the RES 
tract, and the loads were of small d 1 
volume to be readily tubed into the animals. 
The decision to study thirsty anima 
than satiated was guided by two ас 
tions: the greater regularity of results wi 1 
the deprived animals, and the possibility о 
either enhancing ог depressing the drin! ing 
from a 23.5-hour deprivation base line. 


SSS 
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Method 


Since the optimal concentration value for a 2% 
load appeared to be about 6% (in terms of maxi- 
mal potentiation of drinking), it was decided to 
use concentrations of 0 (tap water), 2.0, 4.0, and 
6.0% NaCl. Initially there were 10 animals in each 
group, but in the course of the experiment five 
were lost due to death or treatment error, so that 
35 finally completed the experiment. There were 
nine subjects in three of the groups and eight in 
the fourth; the weights of all Ss averaged 363 
grams on the first load day. АП were male Holtz- 
man albino rats about 100 days old. Following 
adaptation as previously described, each S was 
given the same load once a week over a period of 
Six weeks. Methods of collecting drinking and 


urine data were the Same as in the previous 
experiments. 


Results and Discussion 


Water Intake 


Figure 7 shows the rate of water intake 
for the four different load concentrations 
on the first load day. We see here that our 
choice of 6% NaCl аз the “optimal” load 
for potentiating drinking was incorrect, 
since these animals actually drank less than 
those loaded with 4% during the early part 
of the drinking period. As a check on what 
the “true” optimum might be, a different 
Sroup of 10 animals was given a 2% volume 
load of 5% NaCl. These Ss, in 30 minutes, 
drank a mean of 37.2 сс., which was above 
that Previously obtained with any load. 
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After four hours of drinking, however, in- 
take (57.9 cc.) was so far above the extra- 
polation from lower concentrations as to 
suggest that concentration was not the only 
variable operating (see Figure 21). It is 
clear, however, that the optimal load con- 
centration is closer to 5% than to 6% NaCl. 

The broken lines in Figure 7 show the 
rate of intake following H,O loading on the 
second through sixth load days. There are 
no large differences in the first hour or 30, 
but by two hours there is a clear separation 
of rates, and by two and a half hours intake 
varies as а function of load day, only the 
second day being out of order. The results 
with this particular load concentration are 
representative of those obtained with the 
other three concentrations. As Figure 5 
shows, total four-hour intake increased 
equally for all groups over load days. . 

There was virtually no overlap of indi- 
vidual S's intakes between groups on any 
given load day. Thus, almost all water- 
loaded animals drank less than 2% animals, 
who in turn ingested less than the 49 
group, and so on. The differential treat- 
ments thus were highly reliable, and there 
were no observable ill effects to the animals 
health occasioned by the repetition of loads- 
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The within-group reliability was high 
also, as indicated by rank-order correlations 
for total intake from one load day to the 
next, These correlations averaged between 
.50 and .70 for adjacent treatment days, 
and then declined toward zero with an in- 
creasing number of treatment days separat- 
ing the days correlated. Since the groups 
were small, and were taken from a very 
homogenous population, relatively minor 
Changes in intake would have a large influ- 
ence on the size of the correlations. Гог 
most purposes the good separation between 
treatments is the most important considera- 
tion. The size of the correlations for suc- 
cessive load days suggests that with larger 
numbers of Ss, individual differences could 
be successfully studied. 


Urine Volume and Concentration 


The day-to-day changes in four-hour 
urine output are shown in Figure 9. These 
changes are of the same sort as occurred 
with water intake, tending to increase inde- 
pendently of a particular treatment. There 
is, in fact, a good correlation between intake 
and urine volume on each load day. The 
median correlations for the six load days 
were .74, .70, .25 and .64 for the 0, 2, 4, 
and 6% loads, respectively. 
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Fic. 9. Four-hour urine volume for all groups 
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Fic. 10. Chloride concentration during first and 
second halves of the four-hour drinking period on 
successive load days. 


In Figure 10 the concentrations of uri- 
nary chlorides are shown for each of the 
load days. Whereas chloride concentration 
tends to decline from day to day, total 
chloride excreted remains relatively con- 
stant or increases slightly. Median chloride 
excretion in milligrams for the first and last 
three load days were, for the four groups: 
2.3 and 4.0, 14.2 and 13.0, 43.5 and 65.3, 
and 112.0 and 121.8, respectively. The in- 
crease in urine volume offset the decline in 
concentration of urine, and the net efficiency 
of chloride excretion remained as great at 
the end as at the beginning of the experi- 
ment. The volume increase in urine paral- 
lels the increase in water intake on succes- 
sive load days, but the direction of causal 
relationship is not determinable from the 
present data. It is possible that the repeated 
dehydration stress of the salt loads, super- 
imposed on a chronic negative water balance 
maintained by the 23.5-hour deprivation 
schedule progressively reduces the efficiency 
of the hormonal regulation of volume ех- 
cretion, perhaps through exhaustion of anti- 
diuretic hormone. This can only be decided, 
however, by future experimentation. = 

Excretion of sodium and potassium 
through load days was similar to that of 
chloride. Potassium loss increased as а 
function of load concentration, presumably 
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an obligatory output stimulated by the 
sodium of the load (Gamble, 1958, рр. 126- 
128), but not nearly to the same extent of 
the sodium or chloride increase. There were 
no chemical or behavioral indications that 
potassium excretion was at a dangerously 
high level under any of the treatment con- 
ditions. Reliability correlations for urine 
volume, ion concentration, and for total ion 
excreted were consistently positive, but 
Showed greater variability than did those 
for water intake. 


Weight and Drinking on Interload Days 


Since weight and drinking changes were 
shown in some detail in Experiments 1 and 
2, it is sufficient to report that the present 
experiment confirmed the earlier findings. 
These changes are a function of load con- 
centration and are quite stable from one 
load day to another. 


Daily water intake returns to normal 
within two or three days after loading, but 
weight does not necessarily do so. Figure 11 
shows the mean weights of the four groups 
during adaptation, on the six successive load 
days, and on the final day of the experi- 
ment. Although weights did not return to 
their previous level after loading, there were 
nO permanent differential effects of the dif- 
ferent loads. This is in spite of the fact 
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Fic. 11. Mean weights of load groups on alter- 
nate adaptation days, each load day, and the final 
day of the experiment (F), (The interload weight 
changes—weight increments as a function of load 
concentration—are not shown.) 


that the immediate changes varied from a 2- 
or 3-gram decrement on the day alo 
tap water loads to a 20-gram eir 
following 696 loads. Since the slope of the 
weight increment curve over the first E 
load days is the same as the slope over the 
last few adaptation days, prior to any treat- 
ment, it appears that the weight increases 
are due to normal growth, and are inde- 
pendent of the influence of our load vari- 
ables. 


The regular 30-minute intake on days 
prior to each load day was essentially uns 
changed throughout the course of the ex- 
periment, a further indication of good Tod 
covery from the loads. Considering all four 
groups separately on each of the days pre 
to a loading, and on the final day of pis 
experiment, the mean intake of the groups 
ranged from 18.4 cc. to 21.9 cc., these шеп 
being distributed around a grand mean 0 
20.4. The mean intake of all animals on the 
last day of adaptation was 20.06 and on the 
final day of the experiment it was 20.95 SS 
The animals obviously were not drinking 
appreciably more or less on the deprivation 
schedule at the end of the experiment than 
they were at the end of 10 days’ adapta- 


tion, and intergroup variability was prac- 
tically nil, 


EXPERIMENT 4: DELAY or Access TO 
WwW 


ATER AFTER STOMACH LOADING? 


One of the important variables in the 
study of salt loads would appear to be the 
delay between loading and the Ss’ access to 
water. While Wayner and Reimanis (1958) 


found that following hypertonic subcutane- 


ous NaCl injections drinking was inde- 
pendent of delays up to four hours, this 
may or may not be true for water consump- 
tion after stomach loads, Wayner reported 
no data for urine, but we have shown that 
both urine volume and urine concentration 
are functions of load concentration. To the 
degree that an animal can excrete salt БУ 
concentrating its urine before access {0 
water it might need to ingest less water (45 
was the case with some of the satiated loa 
ee ee 
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animals in Experiment 2) ; and if stomach 
loading is used in behavioral experiments 
the consequences of delays between loading 
and drinking (or training and testing of 
performance) should be understood. 


Method 


The Ss were 120 rats, divided into 24 subgroups 
of five each and averaging 378 grams in weight. 
А random groups factorial design was used, vary- 
ing 2% body weight volumes of 0, 2, 4, and 6% 
NaCl with delays of 60, 90, 120, 150, 180, and 240 
min. between loading and drinking. Each S was 
given the same load and delay twice, with а six- 
day recovery period after each load. The animals 
were delayed in the metabolism cages following 
loading; urine was collected and analyzed from 
tlie delay period, as well as the first and. second 
halves of the 4-hr. drinking period. 


Results 


Water Intake 


Figure 12 shows the total four-hour in- 
take as а function of load concentration and 
delay. These data were taken from the first 
load day only, but the functions were essen- 
tially the same for the second load day (as 
was true for all of the variables), differing 
only in the overall magnitude of intake. An 
analysis of variance showed that load con- 
centration, delay, and load day variables all 
exerted significant effects on intake far be- 
yond the .001 level. The delay X concentra- 
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Fic. 12. Four-hour water intake on Load Day d 
as a function of delay of access to water following 
loading. (The lines are least squares fits to the 


points.) 


tion interaction effect was just significant 
at the .01 level. 

The amount of water ingested in four 
hours showed a somewhat different relation 
between the load treatments than did either 
the 30-minute or 60-minute intake. Whereas 
the decline in consumption as a function of 
delay shows slightly after a half-hour of 
drinking and markedly after an hour, even 
at the 60-minute delay, the 4% and 6% 
groups drank the same amount in the first 
30 minutes (cf. Figure 7). At neither 30 
nor 60 minutes aíter access to water was 
there any separation between the intakes of 
the 2% and 49% NaCl groups delayed for 
four hours. 


Weight and Drinking Changes 
on Interload Days 


The changes in weight on the day follow- 
ing loading also declined as a function of 
delay, as did the 30-minute water intake on 
the day after loading. This, as in the previ- 
ous experiments, seems to be a function of 
total intake on load days. The animals re- 
cover their weights and return to regular 
drinking well within the one-week recovery 
period, as was found in Experiment 3. 


Urine Output 


Figure 13 shows urine output during the 
delay intervals and for the total period of 
recording (delay plus drinking period). 
Urine output during the delay period pro- 
ceeded at a constant rate whose slope was 
determined by load concentration, but total 
urine volume was very little affected by the 
length of the delay interval. The animals 
excreted about the same volume in five 
hours as in eight hours. To determine more 
specifically the effects of access to water on 
urination, the volume of excretion for the 
various subgroups was examined at 240 
minutes after loading. At this time one 
group had not yet been given access to 
water and the other groups had been drink- 
ing for various times up to three hours. 
There were no differences among the six 
delay groups at each concentration, indi- 


cating that urine flow follows a course that 
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Fic. 13. Mean urine out: 
vals on Load Day 1 (upp 
ume (delay plus four-hour 


put during delay inter- 
er) and total urine vol- 
drinking period, lower). 


is little, if any, influenced by whether the 
animal is drinking or not, 


as well as of load 
in Figure 14, whic 
cretion. The abs 
and potassium exi 
of chloride, but t 
the same form, 
during the delay period (about the same for 
all the NaCl load concentrations) and 
progressively declined during the first and 
second halves of the drinking period. 


Discussion 


The effects of delay on water intake, etc. 
are directly attributable to the fact that the 
Ss excrete a more concentra 
hence a greater proportion of 
if they do not have access to y 
ing dilutes the fluid load so th 


ted urine and 
the ionic load 
zater. Drink- 
at it is more 


readily absorbed. Being diluted, the concen- 
tration differential between load and body 
fluids is lower and does not require as 
drastic an excretory correction. ия 

These results become more meaningful 
when compared with those from Experi- 
ments 1 and 2. The total intake of the Ss 
in those two experiments was the same, and 
urine volume was the same, but over the 
range of concentrations used in the present 
experiment the satiated Ss put out a more 
concentrated urine in the last two hours 
than did the deprived. Since the deprived 
Ss immediately drank a great deal more 
water after loading than did the satiated, 
the explanation of the urine concentration 
difference (whatever the exact a 
may be) would seem to be the same as И 
the present experiment. 

The data obtained in this experiment 40 
not support the results of Wayner an 
Reimanis (1958), again probably due to 
differences in excretory control of the 12" 
posed loads. Since hypertonic solutions in^ 
duce movement of body water toward them, 
in Wayner’s experiment the water was 
initially moving toward the interstitial space 
under the skin of the back, whereas in ou! 
experiment the water went into the stomach 
and intestine. In this latter case, the absorp“ 
tive surface is much greater, and corrective 
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filtration by the kidney would be more 
rapid. 

In spite of the various changes that occur 
as a function of load-drink delay, the effects 
of concentration on drinking are maintained 
at all delays, with the exceptions of the 266 
and 4% loads at the four-hour delay. With 
short delays there is no difference between 
the 4% and 6% loads in the first 30 to 60 
minutes of drinking, but this is a temporary 
inhibition, not a differential management of 
the loads. 


EXPERIMENT 5: A FURTHER STUDY OF 
GASTROINTESTINAL EXCHANGE RATES OF 
WATER AND SODIUM CHLORIDE 
IN Tuimsry Rars? 


In an earlier experiment (О’КеПу, et al. 
1958) the gastrointestinal exchange rates of 
3% body weight volumes of water and of 
0.5, 0.87, and 3.0% NaCl were determined. 
The rates were studied up to a period of 
35 minutes after loading. These rates were 
related to bar pressing following loads 
(O'Kelly & Falk, 1958) and to runway be- 
havior (Solarz, 1958). Other experimenters 
have indicated the importance of post- 
ingestive factors in regulating drinking in- 
hibition and other aspects of motivated 
behavior (Miller & Kesson, 1954; Miller, 
et al., 1957; Towbin, 1949). If we wish to 
control motivation by the “artificial” means 
presented in the previous experiments it is 
of some importance to know the rates at 
Which the loads move through stomach and 
intestine and the amounts and distributions 
of the fluids accumulated in the gut by the 
osmotic action of the hypertonic loads. Ac- 
cordingly, the purpose of this experiment 
was determination of gastrointestinal ex- 
change rates for the 29 body weight vol- 
umes of water, 2.0, 4.0, and 6.0% NaCl 


used in the previous experiments. 


Method 


Subjects and procedure. The Ss were 285 
male albino rats of the Holtzman strain, varying 


in age from about 120 to 200 days. All of them 


5 p. T, O'Kelly and К. С. Beck. 


had been used in earlier experiments of various 
kinds, but were readapted to a 23.5-hr. depriva- 
tion schedule ior at least 10 days before loading 
and gut determinations were made. The data were 
gradually collected over the course of a year as 
the animals became available. Ч 

The procedures are described in detail else- 
where (O'Kelly, et al, 1958). Animals are 
stomach loaded with measured volumes of fluid. 
After varying delays they are sacrificed, and the 
stomach, intestine, and cecum are clamped off 
and separately removed. The fluid content of each 
component is determined by obtaining its weight 
before and after oven drying to constant weight, 
with a correction made for inherent tissue water. 
From the measured volumes of fluid recovered 
from stomach, intestine, and cecum after the 
various delays, the rate of movement of fluid 
through these compartments may be determined. 

Experimental design. The experiment called 
for 24 groups of 10 Ss each, given one of the 
four load concentrations at each of six delay 
periods: 15, 30, 60, 120, 180, and 240 min. Be- 
cause the intestinal absorption curve for 6% NaCl 
had not reached an asymptotic level at 240 min., 
5 animals with water loads and 10 animals with 
each of the NaCl loads were sacrificed after 300- 
min. delay. Additional 6% NaCl animals were 
run at 360 and 420 min. six at the former and 
four at the latter delay. 


Results and Discussion 


The results are summarized graphically, 
each point being the mean of all animals 
sacrificed under the particular delay interval 
indicated. Figure 15 shows the fluid recov- 
ered from the stomach after the various 
delays. Rate of fluid clearance is an orderly 
function of concentration. Water and 2% 
NaCl clear exponentially. After an initial 
period of stabilization at 70-90% of load 
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Fic. 15. Percent load volume remaining in 
stomach at various delays after loading. 
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Fic. 16. Percent load volum 


e remaining in the 
small intestine. 


volume, the 4% and 6% NaCl solutions 
Seem to clear in a similar fashion. 

Figure 16 illustrates the time course of 
filling and emptying of the small intestine. 
Water manifests a small net gain of stom- 
ach clearance over absorption in the first 
30 minutes after loading, but transport to 
cecum and absorption have emptied the 
small intestine of the water-loaded animals 
by 60 minutes, Accumulation of fluids from 
the hypertonic loads is, by 30 minutes, 
directly Proportional to the NaCl concen- 
tration of the load, as is subsequent absorp- 
tion and cecal flow, Intestinal distention by 
the hypertonic loads is maximal at 30 min- 
utes and declines rapidly thereafter, 

Accumulation and dissipation of fluid in 
the cecum is shown in Figure 17. Maxi- 
mum fluid тесоуегу occurred at 60 minutes 
post-loading, and cecal emptying is inversely 
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Percent load volume Temaining in the 
cecum, 


proportional to load NaCl deme 
Since no diarrhea was observed during Si: 
delay periods, we assume that the MA | 
portion of the fluid remaining in the ce | 
is subsequently absorbed. Mo 
Figure 18 shows the percentage of г 
which is absorbed from the intestine es a 
several delays. For all the hypertonic оа : 
there is an initial period of negative ие 
tion for some time after loading—i.e., me | 
amount of fluid in the intestine is epee ү 
than that given in the load. The pc к. 
negative absorption is а direct fn MM 
load concentration, but the inflection p ad 
for all loads is at 30 minutes. The 6% M 
does not return to its initial volume 9: " | 
about three hours after loading, and € x 
sorption is not achieved until seven ho 
after loading. M 
These curves suggest that in behav 
experiments it would be best to wait at ae 
an hour, and preferably two hours mes 
loading with hypertonie NaCl solutions oe 
fore requiring the Ss to perform. О! ‘i 
wise, the fluid content of the intestine а, 
be rapidly increasing, with whatever ke 
this might have on performance—pro "e 
inhibiting. This problem will be discu 
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agam after we have presented data f 
Experiment 6, 
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EXPERIMENT 6: ARTIFICIALLY CONTROLLED 
THIRST AND WATER REINFORCED 
Runway BEHAVIOR? 


O'Kelly and Falk (1958) showed that 
when thirsty rats were loaded with water 
that water-reinforced bar pressing on a con- 
tinuous reinforcement schedule was de- 
Pressed in direct proportion to the load 
volume, and that it increased with various 
low concentration NaCl loads. Solarz 
(1958) found that running slows down as 
а water load has progressively more time to 
be absorbed before running is permitted. 

he present experiment was concerned with 
the problem: as а function of load concen- 
tration and runway length, how many highly 
massed, water-reinforced trials will a rat 
run in a single session? This question con- 
cerns both the effects of hydration and 
effort on performance and the methodology 
of using the loading technique in behavioral 
experiments, 


Method 


Subjecis. The Ss were 52 male albino rats of 
the Holtzman strain, 120-160 days of age at the 
beginning of the experiment. They had previously 
een used in other experiments, but had no prior 
experience with either runways or water depriva- 
tion, 


Apparatus. A 2-ft. enclosed alleyway, to which 
а 4-ft. extension could be attached, was used. The 
delay between raising the start-box door and the 
S breaking a photocell beam 6 in. in front of the 
door (starting time) and the time between cross- 
ing the first beam and a second one 6 in. in front 
of the goal-box entrance (running time) were 
automatically recorded. When a water reinforce- 
ment of 40 licks was taken from the drinking 
tube at the back of the goal box, the tube was 
automatically withdrawn, The amount of water 
per lick was estimated from preliminary tests to 


be about .005 сс., or about 0.2 сс. per reinforce- 
ment, 


Experimental design. Adaptation and runway 
training: The animals were adapted to a 23.5-hr. 
deprivation schedule and had two trials daily in 
{һе runway for a period of 10 days prior to test- 
ing. Each day they were weighed, run, watered 
for 30 min, then returned to the home cages 
where they lived in squads of four. 


Test day: On Day 11 each member of a 
squad of four was loaded and after a 25-min. 


IU. cus 
TR. C. Beck and L. I. O'Kelly. 


delay was run to its regular water reinforcement 
until it reached a criterion of three consecutive 
60-sec. latencies in leaving the start box. If 5 
did not leave the start box within 60 sec. or spent 
60 sec. in the alley, he was removed and put into 
the goal box by E to determine if he would drink. 
Whenever 5 was in the goal box he was removed 
either just as soon as the tube was pulled out or 
aíter 60 sec. if he did not drink. Following re- 
moval from the goal box S was returned directly 
to the start box and the next trial was begun in 
about 15 sec, just enough time to allow E to 
record the times ior the previous trial and to 
reset the clocks. When S reached criterion he 
was removed to the regular drinking box and 
allowed 60-70 min. of drinking. 


For the next four days after testing the Ss 
were maintained on the same 23.5-hr. deprivation 
schedule as during adaptation. On the fifth day 
they were loaded with the same solution as on the 
test day, and after а 30-min. delay were sacrificed. 
Their guts were processed as described in Ex- 
periment 5 (they provided the data for the 30- 
min. delay point in that experiment). 


Results 


Test Рау Performance 


Trials to criterion. Table 2 gives the 
number of trials to criterion for each of the 
Ss. These are the number of times each 5 
left the start box, whether or not he drank. 
An analysis of variance showed both the 
overall concentration variable and runway 
length to be significant beyond the .01 level 
of confidence. Individual comparisons of 
subgroups showed the significance of these 
functions to be due mainly to the relatively 
low scores of the Н.О and 696 groups, as 
opposed to the equally high scores of the 
2% and 4% groups. 

Water intake. On almost all trials when 
an animal drank in the goal box the full 
reinforcement was consumed, making it pos- 
sible to plot the relationship between the 
amount of water received at the time the 5 
reached criterion (load plus goal-box in- 
take) and total intake (the hour's drink 
after criterion added). This is shown in 
Figure 19, which also has the number of 
reinforcements plotted on the right-hand 
ordinate. Whereas the function for goal- 
box intake is parabolic in form, total intake 
is a monotonic function with no significant 
differences between the two runway groups. 
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TABLE 2 


SCORES ғов TOTAL TRIALS Кох To CRITERION 


Runway | Load Concentration (% NaCl) 
Length | 0 2 4 6 
| 
2feet | 7 40 55 5 
| 30 4 70 18 
30 55 80 25 
30 67 86 39 
| 31 79 90 141 
47 81 135 
94 
N 6 1 6 5 
М 29.2 65.4 86.0 45.6 
ма 30 67 83 25 
| SD 11.7 19.1 21.7 48.9 
6 feet 3 


N 7 7 7 7 
M 15.0 42.4 46.4 Rak 
Md 12 52 46 9 
SD 10.1 29.4 32.4 14.9 


entries initial], 
Were lost to illness, 


The goal-box drin 


6% 


combined had a tota 
drinking and the 6% 


8 and 18 for the 2% and 4%. The running 


very peculiar, 


Y pledge pushing 
а peanut down the sidewalk with his TUS 


Such animals, in either runway 
у 
to the goal-box entrance, then s 


seconds until put into the goal box, where 


they would not drink. These same animals, 
on the other hand, drank immediately when 
put into the drinking box. The only ap- 
parent reason for this failure to drink in 
the runway was that the tube was slightly 
recessed, whereas in the drinking box the 
tube extended about a quarter of an inch 
into the box, making it more easily acces- 
sible. 


The water-loaded animals had a slightly 
greater total intake on load day than on 
final adaptation day, 21.9 vs. 19.6 in the 
2-foot group, and 21.6 vs. 20.7 in the 6-foot 
group. Considering that the animals had 
more time to drink on test day and that the 
goal-box intake was estimated, these corre- 
spondences are so close that we can quite 
accurately describe the total intake on 10а 
day as the amount of water needed to Te 
store fluid balance. By inference we can 
describe the salt-loaded rats’ intake in ae 
same manner and by comparing intakes а 
the time of criterion with total intake we 
have the percentage of water needed whic" 
had actually been ingested. For the increas- 
ing concentrations of load these were: 
foot: 38, 65, 63, and 3196; and 6-foot: 5% 
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Fic. 19. Load day water intake and number of 
reinforcements, (The left ordinate is the volute 
of water Consumed, and the right ordinate 15 ae 
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ment intake is estimated. The upper curves 4 
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51, 40, and 22%. The Ss in the long runway 
were generally getting about 10% less of 
their total than were those in the short 
runway. 


Time Scores 


During training, the starting-time scores 
decreased steadily, but the animals in the 
2-foot runway consistently started much 
faster than those in the 6-foot runway. 
Running times were about the same for 
both groups, when measured in feet per 
second. On load day the only differential 
effects of treatment on running, other than 
those due to differences in attaining crite- 
rion, were the slower starting times of the 
Н.О and 696 groups as compared with the 
2% and 496 groups in the 2-foot runway 
(medians of the first five trials). On the 
day after loading, both starting and running 
times were significantly slower, but there 
was almost no differentiation among the 
variously treated subgroups and the animals 
in both runways increased about equally. 
Over the four post-load days these times 
gradually returned to their previous levels. 


Weight and Drinking Changes 


Аз found in the prior drinking experi- 
ments, there were increased weights and 
decreased drinking on the day after loading, 
but interestingly, the 6% groups had smaller 
changes than the 4% groups. This differ- 
ence was significant at the .02 level for 
weight changes. The reason for this is not 
clear, but in the repeated load study, on the 
day after the first load, this occurred also. 
After the remaining five load days in that 
study, however, the 6% group did show 
the greatest change so the effect is still 
unclear. 

Drinking was depressed following load 
day as a function of both concentration and 
runway length, rather surprising since both 
runway groups drank the same total amount 
on load day. This may, in fact, be no more 
than sampling error, though significant be- 
yond the .01 level for both variables. The 
2-foot group showed the least change. 


Gastrointestinal Data 


Since these data are the same as those in 
the clearance study, the reader may refer 
back to Figures 15-18. As indicated there, 
the amount of fluid in the stomach and in- 
testine increases with the concentration of 
the load, and even with the 2% NaCl load 
the total fluid content of the stomach and 
intestine at 30-minutes post-loading exceeds 
the volume of the load. The gut data were 
correlated with the various running рег- 
formances, but yielded no significant look- 
ing trends within the various subgroups. 
This lack of relationship may be for any of 
several reasons, such as the small number 
of Ss per subgroup, low variability of scores 
for GI contents, or lack of reliability from 
one loading to another. 


Discussion 


Since the total water intake increased on 
the test day proportionally with load con- 
centration, but trials to criterion did not, the 
loads are seen to have both inhibiting and 
facilitating effects. There are two aspects 
of the data to be accounted for: the differ- 
ential proportion of total intake between the 
2-foot and 6-foot groups at the time the 
criterion was reached, and the nonlinear 
relationship between trials to criterion and 
load concentration. 

Two mechanisms conceivably were opera- 
tive with respect to the difference between 
the two runway groups: differential condi- 
tioning and/or the differential effort in- 
volved in running the two alleys. Since the 
Ss in the shorter alley consistently were 
faster in starting to run during training 
than were those in the longer alley, there 
was presumably some kind of differential 
conditioning such that a smaller amount of 
thirst reduction might suppress responding 
in the group with weaker conditioning. In 
combination with this, the inhibition devel- 
oped from running the longer runway 
should tend to weaken the reaction potential 
for the running response more than would 
the reactive inhibition developing in the 
shorter alley. 


hà 
N 


The nonlinear results for trials to crite- 
rion may be explained by the inhibition 
from the load itself. The increasing amount 
of fluid in the gut following loading un- 
doubtedly made running uncomfortable in 
the 6% groups and perhaps somewhat in 
the 4% groups. The relatively high inci- 
dence of running without drinking in the 
two extreme groups tends to be consistent 
with Solarz’ report (1958) that, after water 
loading, running persisted longer than 
drinking. He discusses this finding in terms 
of differential excitability and suppression 
of habits by thirst “receptor” and “satiety” 
cues, drinking being more sensitive to 
changes in water deficit, increase or de- 
crease, than running. 

The present experiment controlled as 
much as possible the spacing of trials on 
test day and the delay between running and 
loading. These variables are interrelated 
insofar as either of them could allow dif- 
ferential changes in the transfer of fluid 
within the gastrointestinal tract 
might in turn affect running 
The importance of the Spacing of trials is 
indicated by comparin 
respectively, with water, 2% and 4% NaC] 
and run in the 2-foot Tunway with trials 
Separated by 90-120 seconds, Running the 


xtinguished after 


which 


Ee d a 2). A 6% animal ran only 
Tals, however, well within the ran 
. H d е 
Obtained in the experiment proper. h 


problem is further 


however, 
ойр; the 6% 

The starting scor, 
were indicative of i 
duced by our loading, but the hig’ 
trials precluded preci 
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probable that behavioral research using this 
technique of manipulating motivation should 
employ single trials, or some more optimal 
spacing. Since only one animal in the 
present experiment failed completely to run, 
there should be little attenuation from that 
source. 


GENERAL DISCUSSION 
The Linear Scaling of Thirst 


The results we have reported on facilita- 
tion of drinking by loads of hypertonic 
saline, when considered together with our 
findings of drinking inhibition by water 
loads varying in volume indicate the possi" 
bility of arranging load conditions in such 
à fashion as to produce a linear change т 
Water consumption over a very wide range. 
If we тау assume that drive strength т 
this situation is directly proportional to con- 
Summatory response, we see that stomac 
loading provides a technique for securing а 
linear scale of drive strength. It is of inter- 
est, therefore, to find a series of loads which 
will produce increments of the same slope: 
but opposite sign, as the H,O decrements: 
Such a series is shown in Figure 20. Tus 
NaCl points Were determined by obtaining 
а least squares fit to the H,O points, the? 
selecting values from the data of Exper 
ment 1 which showed a reasonable fit to the 
upward extrapolation of the water-load 1€ 
Stession. It turned out that increasing vol- 
umes of NaCl having about a 2% concen- 
tration potentiated drinking in 23.5-houT 
deprived rats to the same degree that the 
Same volume of Water depressed drinking- 
The 3, 4, and 5% volume points were OP 
i ; the 1 and 27% 
Were additional empirica 
The value of this particular exercise lies 
not so much in establishing the fact that 
drinking can be increased or decreased in 4 
a manner, since an infinite variety 
oads might be used to give the same inc" 
ments, but rather, that it can be done 50 
easily and with such precision. 

“Tom the various experiments we © 
also Summarize the results of loading W 
а variety of Concentrations of 2% bo 
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Vic. 20. Linear decrement and increment of 
drinking in thirsty rats. (The left half of the 
graph shows intakes at various times after different 
volumes of water loading—sce also Fig. 3—and the 
right half shows intake after the same volumes of 
NaCl load. The exact concentrations of NaCl are 
Shown with each point. The vertical dotted line 
goes through the zero-load points.) 


Weight volume of load, and compare this 
with the intake of water as a function of 
hours of deprivation. This comparison is 
Shown in Figure 21. There is essentially a 
linear increase in drinking with load con- 
centration up to 5% and then a sharp de- 
cline at 696, on the 30-minute intake. The 
deprivation curve, on the other hand, is 
negatively accelerated ; 47.5 hours of depri- 
vation leads to the same 30-minute intake 
as does a load concentration of about 3% 
NaCl. An extrapolation from the depriva- 
tion intake curve to match the highest 30- 
minute intake after loading would require 
a number of hours of deprivation far be- 
yond the limits of the rat; the four-hour in- 
takes are totally beyond comprehension in 
terms of production by deprivation. 

We may also refer back to Figure 20, and 
compare it with the deprivation curve in 


Figure 21. The highest NaCl load volume 
in Figure 20 (5%) produces a 30-minute 
intake which is just about the same as that 
produced by 47.5 hours of deprivation, but 
the former has the advantage that equal 
increments of volume produce equal incre- 
ments of drinking, rather than progressively 
smaller increments. In addition, whereas 
the health of the 47.5-hour deprivation ani- 
mals steadily declines over 20 days (as indi- 
cated by progressive weight loss), the 
loaded animals remain healthy and appear 
to thrive on the schedule (as shown, for 
example, in the weights and daily water 
intakes over the course of repeated load- 
ings, as shown in Figure 11). 


The Effect of Ether 


One problem not considered in the ex- 
periments proper is the effect of ether per 
se on water intake, it having previously 
been shown to depress drinking in a 30- 
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Fic. 21. Comparisons of water intake after уагу- 
ing hours of deprivation (the tenth deprivation 
interval on each schedule) with intakes following 
loading with various concentrations of 296 body 
weight NaCl. (The data for the 11.5-, 35.5-, and 
47.5-hr. deprivation points were taken from an ex- 
periment run in our laboratory by Jack Dluhy on 
animals identical in age, sex, and strain to those in 
the present study.) 
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minute period (O'Kelly & Weiss, 1955). To 
estimate the results of etherization in Ex- 
periment 1, two additional groups of ani- 
mals (with Ns of 6 and 9, respectively) 
were tested for four-hour intake : one group 
had 23.5-hour deprivation, the other with 
deprivation followed by etherization before 
access to water. These are shown as the 
"0" load volumes in Figure 20. The ether- 
ized group had about 3 cc. less intake in 30 
minutes than the nonetherized group, an in- 
take falling directly on the regression line 
for tap water loads. The nonetherized 
group, on the other hand, fell above the 
regression. After four hours, however, these 
groups had the same intake, indicating that 
the ether effect had worn ой. This is in 
agreement with the typica] findings on 
humans, that ether is dissipated from the 
System almost completely in four hours 
( Goodman & Gilman, 1955, p. 63). 


Since the 
load groups and the etherized 


group all 


maintain their same relative positions 
throughout the four hours of drinking, and 
the no-load etherized 


group gradually 
г group, it would 
as an equally depressing 
rather than a differential 
any particular load. 


catches up to the nonethe 
appear that ether h 
effect on all loads, 
effect on 


Load Inhibition 


| Я appear to produce gastro- 
intestinal distentions inhibiting to perform- 
ance, however, as witnessed by the fact that 
in the runway study the 2% and 4% 
mals did run а great deal $ 

water-loaded 55, There 


some inhibition with the 4% load however 
since the animals under that condition ran 
no more than those with the 2% 


7% load, even 
though they drank more. The 6% animals 


may have been 


were clearly inhibited. The most conserva- 
tive procedure would be to delay the various 
load groups until the highest concentration 
group had cleared the load to the same de- 
gree as the 266 group in 30 minutes, the 
delay period used in the runway study (Ex- 
periment 6). It is difficult to estimate this 
exactly since we do not know just where 3n 
the gut the inhibition occurs. If we look 
at the intestine (ligure 16) we estimate 
about 90 minutes; if we look at stomach 
clearance (ligure 15) we reach an estimate 
of about two hours for 496 loads and poni 
hours for 6% loads (at which time, according 
to the delay study, there is little difference be- 
tween 2% and 4% load groups). Only 
further research will tell us exactly hour 
the best delay is, but all things considere: 
we would presently estimate about wohan 
With this period there is no inhibition Pes 
drinking in the 655 load groups, and absorp 
tion from intestine is pretty far along 
Further experiments are in progress which 
should tell us more exactly what delays at 
best. i 


SUMMARY AND CONCLUSIONS 


^ ing 
From all the experiments, the follow! 
results stand out: 


1. With loading, one can control er 
water ingestion drinking of thirsty rats uon 
à range from complete inhibition to an ! 


©з о 2 : riva 
take maximum unobtainable with depr!V? 
tion. 


the 


2. The volume of NaCl 
Stomach load plays a relatively small role а 
comparison with the amount of 500117 
chloride in the load. For a given amount 0 
NaCl, the effect of increasing the volume 
of water js to reduce the intake to the de- 


Sree that water has been given to the anim? 
їп the load itself, 


3. Within the range of values studied im 
tensively (0-6%, with 2% body weight х0, 
ume), there was no harmful effect of t” 
loads over half a dozen repeated loads ? 
with long delays between loading and acce 
to water. There is по reason to believe п. 
this should not hold for all water loads “ 


а hypertonic 


р adl 
have used (1-5% body weight) or fot в 
values of NaC] content which fall in 


= 


WATER REGULATION IN THE RAT 2 


range (1.е., absolute NaCl contents from 0.3 
to 1.2 grams per kilogram of body weight, 
in solutions between 1-5% body weight). 

4. The technique produces highly reliable 
results over repeated treatments. 

5. With repeated loads, water intake 
tends to increase in a four-hour drinking 
period but this is offset by an increased 
urine output and decreased urine concen- 
tration such that the load is equilibrated to 
about the same extent over repeated loads. 
'The increase in intake over repeated loads 
is the same for all loads used. 

6. Following loading, weight increases 
and drinking decreases as a function of 
load concentration (and hence, of total 
intake on load day). A four-day recovery 
period is probably sufficient, but a six-day 
period is surely sufficient and is consider- 
ably more practical in terms of experimenter 
convenience. 


Л 


7. As a function of delay of access to 
water following loading: (a) drinking in 
four hours decreases, (b) post-load weight 
and drinking changes are attenuated, (c) 
urinary excretion of ions increases, although 
total urine volume is about the same for all 
delay groups. The ability of the rat to 
manage salt loads by renal clearance is an 
important consideration and should be taken 
into account in any experiment with loads. 


8. With the various hypertonic loads, 
water is drawn into the stomach and in- 
testine for the first half-hour after loading. 
The time for clearance and absorption of 
the load is then an orderly function of load 
concentration. After loading with high 
concentration NaCl, drinking and perform- 
ance are initially inhibited; a delay of one 
to two hours is suggested when stomach 
loading of hypertonic NaCl is used in 
behavioral experiments. 
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T= purpose of this study was to in- 
vestigate a method for obtaining an 
objective, quantified, predictive measure of 
educability for children who are severely 
à retarded mentally. The basic hypothesis 
|= Was that the child's educability сап be in- 
| ferred from his responses to learning situa- 
tions which require abilities on which edu- 
cation depends, namely, to learn from 
instruction, to transfer the training, and to 
retain the learning. 
.. With increasing attention focused on the 
. education of these children, the need for 
suitable methods of evaluating their abilities 
has become acute. Their limited abilities 
must be assessed, and attention directed to- 
ward developing the most profitable ones, 
to whatever extent this is possible. 
Unfortunately, it is precisely these chil- 
dren whose abilities are among the most 
difficult to evaluate by the usual testing 
H procedures. On standard tests, the child 
may be heavily penalized by verbal, motor, 
о Sensory, and experiential handicaps. Fur- 
ther, inability to cope with standard tests is 
. . Опе of the outstanding behavioral character- 
istics of the mentally retarded. Tests with 
Which a child cannot cope will hardly serve 


|, 
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to identify his assets. No curriculum can 
be built on what a child cannot do. 

It has been said that “the ideal test in the 
study of mental deficiency would be one 
which investigated the ability to learn” 
(Penrose, 1934, р. 49). This research was 
undertaken to study learning in severely re- 
tarded children, and to use the results as 
the basis for determining what they can do. 


METHOD 


The general method of testing was as fol- 
lows. A battery of tests was administered 
to the subjects individually. Each subject 
was first given the tests without training. 
He was then trained to the correct re- 
sponses according to standard training pro- 
cedures, and retested to permit measure- 
ment of posttraining learning gains. Differ- 
ent forms of the tests were used to evaluate 
transfer. Retention scores were obtained by 
retesting the subject at various later dates. 


Preliminary Experimentation 


The final test battery was determined on the 
basis of the results of a series of pilot studies. 
"The chief aim of the latter was to bring the tests 


Reissenweber of the Institute for the Crippled 
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ociation for the Help of Retarded Children, 
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Bridgeport, Connecticut, the Shield of David In- 
stitute for Retarded Children in New York, and 
the Westchester School for Retarded Children. 
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into increasing accord with criteria which were 
established in advance of the preliminary experi- 
mentation. The criteria were as follows: 

1. Tests should be appropriate to the level of 
ability of the subjects. 

2. Tests should arouse and maintain the inter- 
est of the subjects. 


3. Tests should be suitable for brain-injured 
subjects, 


4. Tests should depent primarily on abilities 
sufficiently well developed at the age levels of the 
subjects, 

5. Tests should depend minimally on past ex- 
perience, 

Before the 
following attempts to meet the 
in the construction of the preliminary battery. 

i i i , tests were 


subjects 
abilities, these areas were thought to be relevant 
n the special Purpose of the tests Imitative 
ea 


‹ 1955, 
hand, Simple size, shape, 


training 
training 


е planned, to hold 


eet the criter 
subjects 


Materials were three-dimensional and relatively 
large. Surfaces were dull to minimize the reflec- 
tion of light. Tests were not timed. The exam- 
iner manipulated the test materials at the sub- 
ject’s direction, if he could not do so himself. 
Only minimal responses, such as simple choices 
among the test materials, were required. Mate- 
rials were presented íar enough apart so that a 
gross motion of hand or arm sufficed to indicate 
the subjects selections. An achromatic scale was 
used, because of possible difficulties in color E, 
ception. Materials were painted black, white, an) 
Various grays in the Munsell Gray Scale, m 
Were presented on a cloth of approximately th 
middle gray in the scale. | 

Developmental norms from the literature 
served as guides to aid in meeting the fou 
criterion, and abilities usually developed at е 
опе year below the age of the youngest subia 
tested were selected. Two methods уеге es 
in the attempt to meet the requirement of nie 
dependence on Past experience, First, tests W ry 
Sought which provided the experience neces 
for Performance in the testing and training be 
cedures themselves, Otherwise, tests were y 
Sidered if it was reasonable to assume that ri- 
materials and Procedures were within the exP© 
ence of all of the subjects, M 


Е i in the 
, Transfer Situations were constructed 8. Ads 
light of E, L, Thorndike's theory, which ^ the 


that transfer depends on identical elements m 
original learning and the new learning whic? 
facilitates (Hilgard, 
forms were 
studies by 


it 


transfer forms would remain within the abilitte® 
of the Subjects, only relatively few and 
the tively simple changes were made. Som 
the principles followed in the construction of 


and Fischer (1957). 


.peri- 

The sample used in the preliminary a^ 
mentation consisted of 115 mentally retar о 
children, Of these, 80 were between the n A 
f the 


group. The remain 
experimentation wit 
and Procedures, 

In addition, Sight severely retarded subit 
between the ages of 3-6 and 4-1, and six chil er- 
0 were tested, to aid in де, 


N 
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to their medical and psychiatric records, were 
also tested on an experimental basis. 

The original battery consisted of 11 tests. 
Preliminary work with some of these tests is 
reported by Schucman (1957-58). At least two 
tests in each area were included in the preliminary 
battery, since it was expected that some of them 
would prove to be unsatisfactory. 

All of the tests were administered in the first 
four pilot studies. The less suitable ones began 
to be dropped thereafter. In the earlier pilot 
Studies, only pretraining, posttraining, and transfer 
Scores were obtained. The later ones also in- 
cluded retesting the subjects at various later dates, 
to aid in determining suitable time intervals at 
which to measure retention. 

Several transfer forms for each test, based on 
changes in the original materials, in the proce- 
dures, or in both, were tried in the preliminary 
experimentation. The following procedures were 
used in introducing change into the materials. 
For tests of shape discrimination, brightness or 
size was altered, and shape was held constant. 
In size discrimination tests, shape or brightness 
was changed, and size kept constant, In test of 
brightness discrimination, brightness was held con- 
stant, and shape or size was changed. 

Various alterations in the original procedures 
for administering the tests were also studied in 
the preliminary experimentation with the transfer 
forms. In one method, the original materials 
were presented, but in a different order. In a 
second method, the original demonstrations to 
the subject were included, but without the expla- 
natory gestures which had accompanied them. In 
а third, the demonstrations were omitted, and 
only the explanatory gestures were given. In a 
fourth, the test was administered according to 
all of the original procedures, but in a different 
Place. 

Materials, procedures, and the order in which 
the tests were to be administered were determined 
її advance of each pilot study, and a list of 
Criticisms was drawn up afterwards. Tests were 
changed, substituted, accepted, or rejected on the 
basis of their adequacy in meeting the test criteria, 
Quantitative and qualitative analyses of the data 
were made after each pilot study, to indicate 
Strengths and weaknesses in the tests. The pre- 
liminary battery, and its evolution in the course 
of the pilot Studies, are described in Appendix B. 


Selection of Subjects for the Final Battery 


The sample to which the final test battery 
was administered was limited to subjects 
suffering from various pathological physical 
conditions known to produce mental re- 
tardation, as stated in their medical records. 
Most of these records were obtained from 
special clinics for the mentally retarded. 


Some were supplied by physicians on the 
staff of the school which the child attended. 
In a few cases, the report of the child's 
personal physician was used. 

Medical diagnoses ranged from relatively 
clear-cut statements of diagnostic category 
(such as "mongoloid") to more general, de- 
scriptive statements (such as  "brain- 
injured"). Subjects whose retardation was 
ascribed primarily to “emotional difficulties” 
or schizophrenia, and categories such as 
"familial" “undifferentiated,” “unclassi- 
fied,” and “etiology undetermined” were ex- 
cluded from the sample. 

In limiting the sample to severely re- 
tarded subjects on the basis of psychological 
test data, only evaluations made within the 
year, by qualified psychologists attached to 
the clinic or school which the child at- 
tended, were used. An IQ below 50 on the 
Revised Stanford-Binet Intelligence Scale 
was used as the criterion for all of the sub- 
jects to whom the scale could be meaning- 
fully administered. Sixty children fell into 
this category. For subjects to whom the 
Binet scale could not be given, an IQ below 
50 on any other standard intelligence test, 
such as the Merrill-Palmer Scale, the Cat- 
tell Intelligence Scale for Infants and 
Young Children, and the Kuhlmann-Binet 
Scale, was accepted. This criterion was used 
for an additional 41 subjects. 

An estimated IQ below 50 was used for 
the 13 subjects who could not, or did not, 
respond to any of the standard intelligence 
tests in a way that permitted meaningful 
scoring. An estimated IO was a measure 
made by the staff psychologist of the school 
or clinic, who had evaluated the child at 
least once during the past year, and who 
was personally familiar with him. The psy- 
chologist based the estimate on a study of 
the child's psychological record, which in- 
cluded the results of whatever standard test 
could be administered to the particular 
child, analyses and interpretations of the 
scores obtained, observations of the child's 
behavior, and clinical impressions of his 
level of mental functioning. The Vineland 
Social Maturity Scale, administered to the 
parent, was the only standard test that 
could be used for all of the subjects, 
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The age range for the final sample was 
set at 5-1 to 11-11. Results obtained in the 
pilot studies suggested that above that age 
level the tests tended to become too easy to 
permit discrimination among the subjects, 
and below it, the tests tended to be too 
difficult. 


Screening Procedures for Sight and Hearing. 
The subjects’ vision and hearing were evaluated 
by the following procedures, Medical records, 
teachers’ reports, psychological evaluations, and 
questionnaires filled in by the parents were studied. 
Trained personnel familiar with the child were 
consulted. If these sources gave no indications 
that the child's sight and hearing were too seri- 
ously impaired, the examiner undertook special 
Screening procedures for the specific purposes of 
the tests. 

Before the child w; 
a number of the s 
placed on the table, 
area to be used in the 
up these materials, 
glanced from one to 
that his sight was ad 
The child’s responses 
examiner from various 
Were also noted. Mater: 
introduced at several р 
vision. If he turned, г 
directions, it was ass 
adequate for the pur 


Sample. The s 


as brought into the room, 
maller test materials were 
covering approximately the 
testing. If the child picked 
reached toward them, or 
another, it was assumed 
equate for test purposes. 
to remarks made by the 
positions behind the child 
ials involving sound were 
joints outside his range of 
eached, or glanced in these 
umed that his hearing was 
Poses of the tests. 


tion of 12.01. The; 
0 to 36 months, m 
10.84. Their ages ranged from 61 to 143 months, 
mean 95.16, deviation 23.48, Thirty- 
seven were mongoloid, and 11 were cerebral- 
palsied. The remainder suffered from various 
pathological conditions known to Produce mental 
retardation, such as microcephaly, hydrocephaly, 
cretinism, and congenital cerebral hypogenesis. 
There were 69 boys and 45 girls, 


Final Battery 
The five tests which comprised the final battery 
are briefly described below. More detailed de- 


scriptions of materials and testing and training 
procedures are given in Appendix A. 


? Estimated IQs up to 55 were used, 


1 provided 
the subject's score on the Stanford-Binet scale 
was below 50. Estimated IQs tended to be sev- 


eral points higher than Stanford-Binet IQs. 


I. Two-Buttons Test was primarily a test of 
imitative ability and memory. Materials con- 
sisted of a number of buttons, and two trans- 
parent boxes into which the buttons were dropped 
in alternating order. Any gross motion in the 
direction of the correct box or boxes constituted 
a correct response on the part of the subject. 
The test included six items, the buttons being 
presented to the subject in increasing numbers. 
Before each item, the examiner demonstrated the 
correct response. The subject was then requested 
to imitate the demonstration. In the transfer 
situation, the demonstrations were omitted, and 
only the original explanatory gestures were made. 

This test was found to be a good beginning 
for the battery. A few preliminary clicks of the 
buttons in the boxes usually aroused immediate 
interest. Giving the child the buttons to play with, 
in advance of the test, was an excellent way of 
establishing rapport. 


II. Buzz Test was chiefly a test of size discrimi- 
nation. It consisted of a number of round, black 
discs of various sizes, attached to a gray board. 
Separate black discs of comparable sizes were pre 
sented to the subject, who was shown, in pretest 
demonstrations, that placing a disc over the corre- 
sponding sample on the board produced a continu- 
ous, buzzing sound. The subject was required to 
indicate the correct placement on the board for cach 
of the discs presented to him. "There were siX 
items, arranged in increasing order of difficulty. 
In the transfer form, white discs, corresponding 
in sizes to the black ones, were presented to the 
subject, to be matched to the black samples on 
the board. 


This proved to be a satisfactory second test 
in the battery. The "buzz" was sufficiently dif- 
ferent from the sounds in the preceding test Ю 
evoke fresh interest and curiosity, Also, as one 
of the shorter tests, it was well suited to a perio 
in which most of the subjects were still “settling 
down.” 

ПІ. Music Box Test was largely а test of 
brightness discrimination, Round music boxes © 
equal size were used, one white, one black, 016 
gray, one black and gray, and one gray ale 
white, The subject was shown, in pretest demon” 
strations, that only the gray box “made music. 
Selection of the gray music box, made in any 
unambiguous way at the disposal of the subject, 
was the only response required. There were SÎX 
items. The gray box was placed in a different 
position for each item, and the other boxes were 
added in increasing numbers. Larger, rectangula" 
boxes, comparable in other respects to the тош" 
ones, were used for the transfer form. ae 

This was the longest test in the battery. It т 
found to be most satisfactory when giyen „е 
middle place, at a time when the subjects Y. 


Мы dd 
usually sufficiently “settled,” and when restles* 
tended to be low, 


T 
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IV. Light Test, primarily a test of shape dis- 
crimination, consisted of a gray box with a 
plastic-covered window in each side. Knobs of 
various shapes appeared on each oí the sides, 
arranged in increasing numbers. The subject was 
Shown that only the rectangular knobs, one oí 
which appeared on each side of the box, lit up a 
series of flashlight bulbs placed behind the win- 
dows. A correct response required the selection 
of this knob, made by the subject in any way 
possible to him, The test included eight items, 
each of the sides of the box being presented twice. 
А second gray box, with black knobs, correspond- 
ing in shape to the original white ones, constituted 
Ше transfer form. 

As one of the shorter tests, this was chosen to 
follow the longest one in the battery. It was per- 
haps the best-liked of the tests, and was well 
suited to overcome developing restlessness and to 
restore flagging interest toward the end of the 
battery. 

V. Three-Buttons Tesi, a second test of bright- 
ness discrimination, involved sorting white, gray, 
and black buttons. The examiner dropped one 
sample button into each of the three transparent 
containers. The subject was then requested to 
indicate the box which contained the sample cor- 
responding to each of the buttons presented to 
him. The test contained six items, the buttons 
being presented in increasing numbers. In the 
transfer form, the positions of the sample buttons 
were reversed, and the subject was required to 
place the buttons according to the new pattern. 

This test was placed at the end of the battery 
largely because a marked tendency toward per- 
Severation resulted unless the two "buttons" tests 
were widely separated. Of the two, this one re- 
quired less time to administer, and made a short, 
interesting final test. 


Test Administrations 


The final battery was administered nine 
times. Administrations А and B measured 
pre- and posttraining performance, respec- 
tively. The transfer forms were used in 
Administration С. These three administra- 
tions were completed for each test before 
the next one was begun. The subject was 
returned to his teacher or parent for the 
interval of approximately an hour which 
elapsed before the next administration was 
given. 

Administration D measured retention 
without additional reinforcement. Pretest 
demonstrations and instructions were omit- 
ted, training was not repeated, and transfer 
forms were not used. Intervals of more 
than an hour for this method of retesting 


were found in the pilot studies to produce 
scores which were too low for purposes of 
discrimination. All of the subjects received 
the first four administrations of the battery. 

Administration E was given approxi- 
mately one week after the original testing. 
Training periods and the use of the transfer 
forms were again omitted. However, pre- 
test demonstrations and instructions were 
included, since it was found that after an 
interval of a week the majority of the sub- 
jects in the pilot studies no longer remem- 
bered what was required without this 
amount of reinforcement. Of the original 
sample, 102 subjects were available for Ad- 
ministration Е. 

Administrations F, G, H, and I were 
given approximately seven weeks after the 
first testing period. They were repetitions 
of Administrations A, B, C, and D, respec- 
tively. Training was again included, and 
both forms of the tests were used. Thirty- 
six of the original subjects were available 
for the final four administrations of the 
battery. 

Administrations A, B, and C of the bat- 
tery, and the comparable administrations in 
the later retesting, required an average of 
about 40 minutes to complete. Administra- 
tions D and I could usually be given in a 
quarter of an hour. Ап average of about 
20 minutes was needed for Administra- 
tion E. 


General Testing Procedures 


Testing in the pilot studies was done by the 
investigator. Qualified, advanced graduate student 
testers administered the final battery. They were 
given a manual, and were supervised by the in- 
vestigator in administering the tests with at least 
10 subjects. These scores were not included in 
the results. 

Subjects were tested individually, in the school 
or clinic which they attended. The number of 
sessions used to complete the battery depended on 
the needs of the particular child. Rest periods 
were sometimes necessary during the first three 
administrations. They were allowed only between 
the completion of Administrations A, B, and C 
of a particular test, and the beginning of the next 
one. If more than one session was necessary, the 
time intervals between the sessions were kept 
parallel when the child was retested. In most 
cases, however, the first three administrations 
were completed in a single session, 
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The child's handedness was determined in ad- 
vance, if possible. In the relatively few cases 
where this was not possible, right handedness was 
assumed. Test items were presented toward the 
child's favored hand, the middle point of the item 
corresponding roughly to the midline of the 
shoulder on that side. 

Since the examiner manipulated the materials 
for the child, if necessary, the subjects were given 
equal opportunities to see the completed responses, 
and to make or indicate spontaneous corrections. 
All responses, both in testing and training, were 
corrected or correctly completed by the examiner, 
so that the subjects had equal opportunities to see 
the correct response the same number of times. 


General Training Procedures 


lt was found in the pilot studies that training 
the children to both the correct responses and to 
the avoidance of incorrect ones tended to be too 
confusing. Therefore, for the final battery, the 
subject was trained to the correct responses only. 
In order that the majority of the subjects would 
succeed in the training periods, extremely simple 
procedures were followed, 

In general, training consisted of demonstrations 
of the correct responses by the examiner; imita- 
tion of this demonstration by the subject, with 
cues supplied by the examiner; and prevention 
or immediate correction of errors. Most of the 
subjects were able to achieve a number of correct 
responses under these conditions. The items were 
presented for training in the same order in which 
they were given in the tests. The subject was 
trained to a maximum of three times on each 
item, beginning with the first item failed on the 


test. Exceptions to these proced: T 
Appendix A, procedures are noted in 


Scoring Procedures 


The subject was given one point for each cor- 
rect response. Scoring booklets contained one 
page for each test, on which each administration 
was scored separately. To minimize possible con- 
tamination in scoring, the original booklets were 
taken from the testers after the completion of 
Administration E. Also, the testers were not 
originally informed that any further testing was 
anticipated at that point. New scoring booklets 
were used for Administrations F, G, H, and I. 


Teachers’ Ratings 


Ninety-nine of the subjects attended special 
schools for mentally retarded children, Teachers’ 
ratings were obtained for this group, from the 
child's current teacher in whose class he had been 
for a period of at least one month. A five-point 
rating scale was used, on which the teacher was 
asked to rate the child's learning ability, as com- 
pared with that of the other children who at- 


tended that school. The various populations of 
the different training schools were thought to be 
sufficiently similar to warrant the use of this 
frame of reference. 


Resutts* 
Test-Retest Comparisons 


In order to determine the stability of the 
battery, product-moment correlations were 
computed between the scores obtained on 
comparable administrations of each test, by 
the 36 subjects who were still available for 
retesting after an interval of seven weeks. 
The coefficients, all of which were highly 
significant, are shown in Table 1. 


TABLE 1 


ТЫ 
Propuct-Moment CORRELATION COEFFICIENTS 
OBTAINED ву TEsr-RETEST COMPARISONS 


(N — 36) 

Adminis- 'Test | Test | Test | Test Test 
trations I Ira MELTS ave) У 
Aand F .66 | .82 | .85 | .79 | .80 
Band С .93 | .95 | .94 | .92 | .95 
Cand H .93 | .90 | .91 | .92 | .96 
Dand I 95 | .90 | .95 | .92 | .95 


* All significant at .001 level. 


It can be seen that the coefficients PFO” 
duced by the pretraining administrations © 
the tests (Administrations А and Е) were 
always lower than those obtained betwee” 
comparable learning scores. The former 
ranged from .66 to .85 and the latter from 
.90 to .96, with no overlap between the 
ranges. The likelihood that the A-F co- 
efficients would be consistently lower than 
those between the learning scores is exceed- 
ingly small (р < .01) by the Mann- 
Whitney U test (Siegel, 1956). 


: * Statistical tables, referring to the original 
investigation and entitled Supplement to Meast iy 
ment of ihe Educability of Severely Ме. 
Retarded Children (Schucman, 1959), are avt 
able on interlibrary loan from: Gould Mem?'z5, 
Library, New York University; New York 
New York. 
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Intertest and Intratest Comparisons 


То determine the intertest consistency of 
the battery, product-moment correlations 
were performed between the scores from 
the same administrations of the different 
tests, i.e., scores from Administration A of 
each test were compared with scores from 
Administration A of every other test, and 
so оп. The 102 subjects who remained 
available for Administration E were used 
in these analyses, which were based on 
scores from Administrations А. ВС, D; 
and E of each test. The correlations thus 
resulted in 50 coefficients, 10 for each of the 


administrations. 
All of the coefficients reached the .001 


level of confidence. However, those yielded 
by the pretraining scores, which ranged 
from .31 to .63, were significantly lower 
than those produced by the 40 comparisons 
between the learning scores, which ranged 
from 2 to .87, with no overlap between 
the ranges (р < .01 by the Mann-Whitney 
U test). 

То determine the intratest consistency of 
the battery, product-moment correlations 
were also computed between the scores ob- 
tained from different administrations within 
the same tests, ie. between scores from 
Administrations A, B, C, D, and E of each 
test, using 102 subjects. Fifty coefficients, 
10 for each of the tests, were thus obtained. 

Again, all of the coefficients were signifi- 
cant at the .001 confidence level. However, 
the 20 coefficients obtained between the 
pretraining and the various learning scores 
within each of the tests were consistently 
lower than those produced by the 30 corre- 
lations of the various measures of learning 
performance. The former ranged from .50 
to .89 and the latter from .86 to .97, with 
overlap between the ranges occurring in 
only one instance (р < .01 by the Mann- 
Whitney U test). 


Comparisons of Scores Obtained 
on Different Administrations 


In order to determine whether the dif- 
ferent administrations of the battery had 
produced significantly different results, a 
series of subjects X conditions analyses of 


variance (Edwards, 1950) were performed, 
comparing pretraining, posttraining, trans- 
fer, and retention performance of all of the 
subjects, as obtained on Administrations A, 
B, C, and D of each test. 

The results demonstrated that highly 
significant differences were obtained on 
every test, the Fs ranging from 94.90 to 
163.97 (df = 3, 339). With a single excep- 
Поп (between posttraining and retention 
scores on Test III), the /s between the 
means, which ranged from the .05 to the 
001 levels of confidence, showed that 
significant differences were found between 
the scores from all of the administrations 
of all of the tests. The mean scores obtained 
on the various administrations of each of 
the tests are shown in Figure 1. (In com- 
paring the mean scores from the different 
tests, it should be borne in mind that Test 
IV contained eight items, while the others 
included six). 

A second series of analyses of variance 
was performed between the scores obtained 
on comparable administrations by the 36 
subjects who received all of the administra- 
tions of the battery. The Ёз between ad- 
ministrations ranged from 12.16 to 29.44 
(df — 7, 245), all significant at the .001 
level of confidence. The ts between the 


(№114) 


Mean Scores 
591025 UDOW 


Administrations 


Fic. 1. Differences in mean scores obtained on 
various administrations. 
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means obtained on the different administra- 
tions demonstrated the following. 


Except on Test 1, the scores produced by 
Administration F were significantly higher 
than those from Administration A, with ts 
from the .05 to the .001 levels of confidence. 
Administration F always produced signifi- 
cantly lower scores than Administration B, 
with highly significant #5. In three out of 
five instances, scores from Administration 
F were also significantly lower than those 
from Administration D. It was also found 
that similar learning curves were obtained 
across time by Administrations A, В, C, 
and D on the one hand, and Administra- 
tions F, G, H, and I on the other. 


Effect of Population Variables 
on Test Performance 


. The influence on test performance of five 
independent variables, namely, sex, time 
spent in school, age, diagnostic category, 
and intelligence, was analyzed. Estimated 
IOs were used as the measure of the sub- 
jects’ intelligence, since no one standard in- 
telligence test could be validly administered 
to all of the subjects. Estimated IQs were 
thought to be the most suitable measure of 


intelligence for the group for the following 
reasons. 


Estimated 105, Vinel 
ratings, and Stanford-B 
tainable Were intercorrelated (7), resulting 
in highly significant coefficients which 
ranged from .54 to .92. However, estimated 
IQs were found to have the closest relation- 
ships with all of the other measures, cor- 
relating .92 with Stanford-Binet IOs, .82 
with teachers' ratings, and .81 with Vine- 
land SQs. To estimate the reliability of the 
measure, two estimated IQs were obtained 
for 11 subjects who had been evaluated in- 
dependently by two different psychologists 
within the year, once at a clinic and once at 
a school. The product-moment correlation 
computed between these estimates yielded a 
coefficient of .87 (р < .001). 

Before analyzing the effect of each of the 
five population variables on test perform- 
ance, multiple correlations (R) were per- 


and SOs, teachers' 
inet IOs where ob- 


formed, to determine the extent to which 
these factors together accounted for the 
variation in the scores. All of the subjects, 
and scores from Administrations A, B, C, 
and D of each test, were used in these 
analyses. 

The coefficients, all highly significant, are 
shown in Table 2. It can be seen that the 


TABLE 2 
MULTIPLE CORRELATION COEFFICIENTS* 


BETWEEN THE FIVE INDEPENDENT VARIABLES 
AND ADMINISTRATIONS OF EACH TEST 


(N = 114) 
= —= 
Adminis- | Test | Test | Test | Test к, 
stration | 1 II III IV 
| eem 
A .12 | .72 | 48 | .67 | -78 
Em 
B .88 | .87 | .86 | .86 | -87 
EE. 
e .90 | .85 | .86 | .85 | .85 
D .83 .84 .84 .86 .86 
* All significant at .001 level. 


coefficients between the pretraining scores 
and the independent variables, which ranged 
from .48 to .78, were always lower tha? 
those between the learning scores and the 
independent variables, which ranged from 
83 to .90, with no overlap between the 
ranges (р < .01 by the Mann-Whitney 
test). 

Expressed as coefficients of determination 
(К), the above results demonstrate that 
the factors of sex, time in sc 
nostic category, and intelligence together 
accounted for 23 to 61% of the variation in 
pretraining performance, and 69 to 80% of 
that obtained in the learning scores, 

. The influence of each of the five popula- 
tion variables on the scores obtained fron! 
the first five administrations of each test 
was analyzed by a series of "repeated meas 
urements" analyses of variance (Edwards 
1950). Groups were proportionalized 5^ 
that nonsignificant differences existed PÊ 
tween them with regard to each of the a 
population variables not under conside"? 


poe 
hool, age, diag 
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tion at the time. Chi square and f tests 
were used to test the significance of these 
differences. 


Effect of Sex. Analyses for determining 
the influence of the factor of sex on the 
test scores were based on 24 boys and 24 
girls. The results demonstrated that this 
variable did not significantly affect perform- 
ance on any of the tests, either as а main 
effect or in interaction with the administra- 
tions. 

Effect of Time in School. The three 
levels of time spent in school which were 
used in the analyses for the effect of this 
variable on the test scores were based on 
the length of time of the subjects’ school 
attendance. Nine months were counted for 
each actual year, the vacation months being 
excluded. 

Subjects with little or no schooling were 
assigned to the lowest level. There were 28 
subjects in this group, who had attended 
school from 0 to 2 months, with a mean of 
1.04. The middle level consisted of 26 sub- 
jects, whose school attendance ranged from 
3 to 13 months, mean 7.58 months. The 21 
subjects in the highest level had attended 
school for relatively long periods, ranging 
from 14 to 33 months, mean 21.53. 

The results of the analyses demonstrated 
that the factor of time spent in school did 
not significantly influence performance on 
any of the tests, either alone or in inter- 
action with the administrations. 


Effect of Age. The levels used for the 
variable of age corresponded to three gen- 
eral levels of school readiness for this type 
of population. The groups were somewhat 
older than is customary with “normal” chil- 
dren at corresponding levels, which is usu- 
ally the case with severely mentally retarded 
subjects. 

The youngest level corresponded to a 

reschool group, and consisted of 37 sub- 
jects. aged 61 to 83 months, with a mean of 
71.00 months. The middle level constituted 
а beginning and early school group, and 
contained 36 subjects whose ages ranged 
from 84 to 112 months, mean 96.39. The 
21 subjects in the oldest level were ready, 
on the basis of age. for more sustained and 


concentrated training. Their ages ranged 
from 115 to 143 months, mean 126.81. 

Analysis of variance demonstrated that 
age did not significantly affect performance 
on Tests I, IV, or V, and no significant 
interaction effect was found on any of the 
tests. However, the Fs between the age 
levels on Tests II and III were both sig- 
nificant at the .05 level of confidence, show- 
ing that the factor of age exerted a sig- 
nificant effect on the scores of each of 
these tests considered as a whole. The be- 
tween-level ts (р < .05) demonstrated that 
the mean score of the youngest group was 
significantly lower than that of the middle 
or oldest levels on both of these tests. No 
significant differences were obtained be- 
tween the scores of the middle and oldest 
groups, or between the age levels on spe- 
cific administrations. 


Effect of Diagnostic Category. Levels 
used for the variable of diagnostic category 
were based on the extent of the child's 
handicaps and the impairment of his gen- 
eral level of functioning. The 55 brain- 
injured subjects (ie, children who had 
suffered trauma to the brain) were assigned 
to the lowest level. They were thought to 
be the most handicapped in the sense that 
various types of physical handicaps were 
present in addition to the mental retarda- 
tion. Mongoloids were placed at the high- 
est level, which included 17 subjects. They 
were considered to be the least handicapped 
in that, as a group, their motor coordina- 
tion was relatively good, and their behavior 
comparatively stable. The middle level con- 
sisted of 21 subjects in various diagnostic 
categories occurring in insufficient numbers 
to be treated as separate groups, and whose 
medical records and life histories lacked 
evidence suggestive of trauma to the brain. 

The analyses demonstrated that the fac- 
tor of diagnostic category did not signifi- 
cantly influence the scores obtained on Tests 
I, II, III, or V. There was no significant 
interaction effect on any of the tests. How- 
ever, an F significant at the .05 level of 
confidence on Test IV showed that this 
variable affected performance on the test 
asa whole. The between-level ts (p < .05) 
demonstrated that the mean score of the 
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mongoloids was significantly higher than 
those of the other two groups. No signifi- 
cant differences between the levels of diag- 
nostic category were obtained within spe- 
cific administration. 


Effect of Intelligence. Three levels were 
used for the factor of intelligence, based on 
the division of the total range of estimated 
IQs approximately into thirds. The lowest 
level contained 21 subjects, whose esti- 
mated IQs ranged from 8 to 25, mean 
15.95. The middle level, which consisted of 
31 subjects, ranged in estimated IQs from 
27 to 39, mean 34.10. The estimated IQs 
of the 50 subjects assigned to the highest 
level ranged from 40 to 55, mean 46.40. 

Analysis of variance clearly demonstrated 

that the factor of intelligence, both alone 
and in interaction with the administrations, 
exerted a highly significant influence on the 
scores of every test. These results are 
shown in Figure 2, which presents the 
mean scores obtained by the three levels 
of intelligence on the five administrations 
of each of the tests, 
_ The Fs obtained between the levels of 
intelligence, all of which were highly sig- 
nificant, ranged from 68.62 to 121.92 (df 
= 2, 99). The between-level ts, all sig- 
nificant at the .001 level of confidence, 
showed that the high intelligence group per- 
formed Significantly better than the low or 
middle levels, while the low group was 
significantly inferior to the middle level on 
every test considered as a whole. 

Further, the ts between the intelligence 
levels on specific administrations, signifi- 
cant at the .001 confidence level in every 
case, demonstrated that the mean scores 
of the high intelligence group were signifi- 
cantly greater than those of the low group 
in all types of learning performance. Also, 
the low group was always significantly in- 
ferior in learning performance to the mid- 
dle group, and, with one exception, the 
middle intelligence level was significantly 
inferior to the high group. 

It can therefore be said that significant 
differences were found between the per- 
formance of the three levels o£ intelligence 
in 59 of the 60 comparisons based on learn- 


ing scores. On the other hand, the #5 for 
pretraining performance, which were al- 
ways lower than those obtained for the 
learning scores, failed to reach statistical 
significance in 4 of the 15 comparisons. 
Of particular interest was the highly sig- 
nificant interaction between the levels of 
intelligence and the test administrations, 
which was found on all five tests. The in- 
teraction Fs ranged from 7.01 to 20.95 
(df = 8, 396). The within-level ts showed 
the following differences in the perform- 
ance of the different levels on the various 
administrations of the tests. | 
The only significant differences found in 
the scores of the low intelligence level were 
obtained between pretraining and posttrain- 
ing scores on four of the five tests. The 
group failed to transfer the training and 


Не —— . Testi 


Fic. 2. Mean scores obtained at different level 
of intelligence 
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to retain the learning to a significant extent 
on any test. 

For the middle intelligence level, all mean 
scores obtained for learning performance 
were significantly higher than those from 
pretraining administrations (р < .001). 
Also, significant differences were found in 
17 of the 30 comparisons made between 
the various learning scores obtained by this 
group. 

Highly significant differences were al- 
ways found between pre- and posttraining 
scores for the high level of intelligence. 
The ts between the means were always 
larger than those for the middle intelligence 
level, with no overlap between the ranges 
(р < .01 by the Mann-Whitney U test). 
Among the different types of learning 
scores, highly significant differences were 
obtained in 21 of the 30 comparisons. 


Still another demonstration of the sig- 
nificantly greater influence of the factor of 
intelligence on learning as opposed to pre- 
training performance was found in coeffi- 
cients of determination based on multiple 
correlations (R?). All of the subjects, and 
Scores from Administrations А, В, C, and 
D of each test, were used in these analyses. 

The results demonstrated that the vari- 
able of intelligence accounted for 18 to 
559% of the variation in pretraining per- 
formance, and 62 to 7696 of the variation 
obtained in the learning scores, with no 
overlap between the ranges (р < .01 by 
the Mann-Whitney U test). 


Comparisons of Test Scores with 
External Criteria 


The intercorrelations of estimated IQs, 
teachers’ ratings, Vineland SQs, and Stan- 
ford-Binet IQs have already been men- 
tioned in the section devoted to the rela- 
tionships of estimated IQs with the other 
measures. The coefficients were all signifi- 
cant at the .001 level of confidence, demon- 
strating that the criteria were not unrelated. 
In addition to those already reported, the 
correlations produced a coefficient of .61 for 
Vineland SQs and Stanford-Binet TQs, and 
teachers’ ratings correlated .72 with Vine- 


land SQs and .54 with Stanford-Binet IQs, 


the latter being the lowest coefficient ob- 
tained. 


To estimate the reliability of the teach- 
ers’ ratings, a product-moment correlation 
was performed between independent ratings 
made by two teachers who had taught the 
same child for at least one month during 
the past year. Two such ratings were 
obtained for 25 of the subjects, and corre- 
lated .85 (p < .001). 

Product-moment correlations were com- 
puted between the scores from Administra- 
tions A, B, C, D, and E of each test, and 
teachers’ ratings, Vineland SQs, and psy- 
chologists’ estimated 105. Although the 
close relationship between estimated IQs 
and test performance was already demon- 
strated, this measure was again included 
here because of the added information sup- 
plied in this context. 

The analyses resulted in 75 coefficients 
(5 administrations X 5 tests X 3 criteria). 
Those coefficients ranged from .31 (р < 
.01) to .87, all except the first reaching the 
:001 level of confidence. With two excep- 
tions, both based on pretraining perform- 
ance, the coefficients between the test scores 
and estimated IQs were always higher than 
those obtained between the test scores and 
the other criteria. In 18 out of 25 instances, 
the relationships between teachers’ ratings 
and the test scores were greater than those 
between Vineland SQs and the test scores. 

The correlations of pretraining scores and 
estimated IQs yielded coefficients ranging 
from .44 to .77, while those obtained be- 
tween the learning scores and this criterion 
ranged from .70 to .87. Within each test, 
the coefficients based on learning scores 
were always higher than those obtained for 
pretraining performance. 

To determine the significance of these 
differences, ts were found for the differ- 
ences between the pairs of correlation coef- 
ficients, one of pretraining score with the 
criterion and the other of learning score 
with the criterion, as obtained for each 
test (Guilford, 1956, pp. 193f.). The results 
showed that the relationship between learn- 
ing performance and estimated IQs was 
always significantly greater than that be- 
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tween pretraining performance and this 
criterion. 


The correlations between teachers! rat- 
ings and the test scores produced the fol- 
lowing results. The coefficients based on 
pretraining scores ranged from .47 to .72, 
and those obtained between the learning 
Scores and this criterion ranged from .67 
to 8. Again, the latter were always 
greater than the former within each test. 
The ts between these differences demon- 
strated that the relationships between the 
learning scores and teachers' ratings were 
significantly greater than those between 
initial test performance and this criterion 
in 16 of the 20 comparisons. 

Similar results were obtained by the cor- 
relations of Vineland SQs and the test 
Scores. The coefficients for initial test per- 
formance and the criterion ranged from .31 
to .70, while those obtained for the learn- 
ing Scores ranged from .64 to .76, with no 
overlap within the Separate tests. The [5 
performed between the coefficients showed 
that the learning scores were more signifi- 
cantly related to Vineland SQs than were 


the Pretraining scores in 15 instances out 
of 20. 


Discussion 
Pilot Studies 


Although the imm 


: ediate purpose of the 
pilot studies was to E te 


ў increase the adequacy 
ОЁ the tests in meeting the criteria estab- 


lished for them, а number of ancillary 
findings Were obtained from the prelimina: 
experimentation which suggest further Te- 
search in two main directions, The first is 
the possible extension of the method of 
testing used in this study to other groups 
of children. The following findings are 
relevant in this connection, 

Several tests in the experimental battery, 
and some of the earlier forms of the final 
tests (see Appendix B), were found to be 
too difficult for the sample used in the pres- 
ent study, and were therefore not used in 
the collection of the final data. However, 
they were administered experimentally to 
10 moderately retarded subjects aged 5-8 


to 11-1, with IQs on the Revised Stanford- 
Binet Intelligence Scale ranging from 51 
to 74. 

Though the number of subjects was too 
small to warrant generalizations, the ob- 
tained results suggest that these tests are 
suitable for evaluating learning abilities in 
moderately retarded subjects, and may pro- 
vide the basis for an upward extension of 
the present battery. . 

On the other hand, some of the tests in 
the original battery, as well as some of the 
early forms of the final tests, were found 
to be unsuitable for the present study bgs 
cause they were too easy for the subjects. 
These tests were tried with eight severely 
retarded children, aged 3-6 to 4-11. Al- 
though the number of subjects was again 
very small, the results indicate that these 
tests can be used for severely retarded chil- 
dren below the age levels used in this study: 
and may be a basis for a possible age scale. 

Of considerable interest are the results 
obtained by 10 subjects diagnosed as schizo- 
phrenic on the basis of medical and psychi- 
atric records. These subjects were among 
a group of 24 children, who had already 
been evaluated by a psychologist but whose 
medical records were not yet available, АП 
of these children were given the final bat- 
tery, and the results were set aside pend- 
ing the arrival of the medical records. 

Analyses of the scores showed that 11 of 
the subjects had performed markedly bet- 
ter on all of the tests than did the remain- 
der of the group, whose mental levels were 
comparable. When the medical reports аг- 
rived, it was found that 10 of the 11 sub- 
Jects were diagnosed as schizophrenic. I 
is possible that tests such as these шау 
provide a method for differentiating the 
So-called “pseudoretardate” from the men- 
tally retarded child functioning at а com- 
parable mental level. 

Results obtained on 
ultimately rejected as un 
injured subjects are also 
Preliminary analysis of 


tests which were 
suitable for brain- 
of special interest- 
the limited results 
obtained in this area suggest that the brain- 
injured children were more sharply differ 
entiated from the non-brain-injured sub- 
Jects by their learning performance than by 
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their initial test scores. It may be that 
learning behavior, as measured on tests 
such as these, is a better indicator of the 
presence of brain-injury than are single 
administrations of a test or tests. 

The second area which warrants further 
investigation, on the basis of results of the 
pilot studies, is the possible application of 
the improvements made in the tests during 
the preliminary experimentation to curricu- 
lar materials and training methods in the 
classroom. 

Among the chief difficulties which arose 
in the development of the battery were the 
problems of increasing motivation for learn- 
ing, improving the procedures for train- 
ing, and minimizing the tendency toward 
perseveration. А number of successive 
changes were introduced into the test dur- 
ing the pilot studies, which progressively 
increased the efficiency of the tests in these 
respects. The specific set of variables re- 
sponsible for these improvements cannot 
be identified at this time. However, the 
results are suggestive, and may have direct 
applicability to the training of severely 
mentally retarded children in the classroom. 
The findings are as follows: 

1. Simple sensory rewards, such as the 
sound of bells, buzzers, and chimes, and 
the turning on of lights, were the most 
satisfactory devices for increasing motiva- 
tion. 

2. These rewards could be built into the 
materials themselves, or rewards should be 
supplied by the examiner verbally as the 
child performed. А combination of both 
was the most successful. 

3. The positive effect of these rewards 
lasted for relatively short periods. There- 
after, the introduction of another such re- 
Ward was necessary to reawaken interest 
and maintain it for another brief period. 

4. The factor of novelty appeared to be 
more important in the rewards than was 
the particular sense modality involved, i.e., 
substituting one sound for another was as 
effective as changing from an auditory to 
а visual reward. 

5, The use of external rewards, such as 
offers of toys or candy, was often helpful 


in establishing and maintaining rapport, but 
was not found to be a significant factor in 
motivating performance on the tests. 

6. The usual testing practices (smiling, 
nodding, and so on) were highly unsatis- 
factory. An extremely active and constant 
participation on the part of the examiner, 
on the other hand, was a highly facilitating 
factor. 


7. The participation of the examiner was 
most effective in the form of clapping, 
laughing, singing, and patting the child. 

8. Training to the correct responses only 
was considerably more productive than 
training to both selection of the correct 
responses and avoidance of the incorrect 
ones. The inclusion of both tended to con- 
fuse the subjects. 


9. When it was necessary to demonstrate 
to the subject that certain of the materials 
were nonfunctioning, it was essential to re- 
inforce the correct response immediately 
thereafter. 

10. The most suitable training methods 
were those in which all procedures were re- 
peated without variation. Whether this 
would be so in longer training periods is 
not known. However, it can be said that the 
most effective combination for facilitating 
learning in the short training sessions used 
for the tests was that of constant changes 
in the materials, without the introduction 
of change into the training methods which 
were used with each of them. 

11. Perseveration was minimized when 
the pattern of the required responses was 
not changed, and only the number of the 
responses was increased. The introduction 
of even minimal changes into the pattern 
tended to result in an increase in persevera- 
tion. 

12. Items which required choosing among 
materials so arranged that the increment 
in size or degree of brightness between ad- 
jacent materials was constant tended to in- 
crease perseveration. 

13. The tendency to perseverate was 
minimized when dissimilar objects were 
placed next to each other, and also when 
the test items were arranged so that the 
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selection of adjacent materials was not re- 
quired. 


14. Perseveration was apt to occur when 
training began with more complex items. 
Under these circumstances, the subjects 
were often unable to return successfully to 
the simpler responses which they had previ- 
ously given correctly. 


15. Single demonstrations of one or two 
of the simpler items, before undertaking 
training on the more complex ones, greatly 
facilitated a later return to the simpler items 
previously passed. 


Adequacy of the Final Battery in 
Meeting the Test Criteria 


The tests were apparently within the 
abilities of the subjects since, as a group, 
they obtained significantly higher scores 
after training, and were able to transfer 
the training and to retain the learning to 
a highly significant extent on every test. 
Tt can also be said that the tests were able 
to discriminate among the subjects, since 
children at different levels of ability pro- 
duced significantly different scores. 

_ The behavior of the subjects in the test 
situation clearly pointed to the adequacy of 
MD in arousing and maintaining 

t est. These qualitative observa- 
tions were supported by the analyses of the 
quantitative results. The inter- and intra- 
test comparisons demonstrated a high de- 
gree of consistency in the performance of 
the group, and a definite, consistent, and 
stable pattern was found in all of the learn- 
ing curves. Such findings would hardly be 
likely to occur in the face of lack or loss 
of interest. 

Although the effect of brain-injury on 
test performance requires further study, 
the obtained results suggest that the battery 
was essentially suitable for these children. 
The brain-injured group succeeded in mak- 
ing significant gains in immediate learning, 
and was able to transfer and retain the 
learning to a significant degree. Further, 
children with no speech and with severe 
motor handicaps were able to respond ade- 

quately. 


The obtained results are also thought to 
show that the tests depended chiefly on 
abilities sufficiently well developed at the 
age levels of the group. Although a limited 
effect of the variable of age was found on 
two of the tests, significant learning gains 
were obtained by the different age levels 
on all of them. 

Adequacy in meeting the requirement of 
minimal dependence on past experience can 
only be inferred. Perhaps the most directly 
relevant finding was that the factor of time 
in school, which ranged from none through 
five years, did not significantly influence 
performance on any of the tests. . 

Differential experiences аге necessarily 
involved in a range of five years of school 
attendance. Further, school experiences ате 
of a kind which might a priori be believed 
to be the most likely to influence perform- 
ance on tests such as these, especially Since 
the materials and procedures were not Un- 
like those used in the training schools. It 
can therefore be said that precisely those 
kinds of experiences which might reasona- 
bly be expected to exert a differential in- 
fluence on test performance failed to do so. 

In brief, it is thought that the results 
demonstrated that the battery met the cri 
teria reasonably well. The tests were ар” 
propriate to the range of abilities and the 
age levels studied, succeeded in arousin& 
and maintaining the interest of the children: 
and were suitable for subjects with severe 
motor and verbal handicaps. 


Effects of Population Variables on 
Test Performance 


No significant influence of the factors 
of sex or time spent in school was foun 
on the scores obtained from any of the 
tests. Apparently the battery was suitable 
for either boys or girls, and for children 
with school experience ranging from попе 
to five years. The variable of age exerte 
а limited effect on the scores of some О 
the tests, its influence being restricted e) 
the performance of the youngest subjects: 

No significant influence of the factor ° 


diagnostic category was obtained for. 
majority of the tests, and only a limit? 
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effect of this variable, restricted to the per- 
iormance of the brain-injured subjects, was 
found at all. However, these results were 
to some degree a function of the levels into 
which the group was divided for the anal- 
yses in which the findings were obtained. 

It is not thought that the levels of diag- 
nostic category which were used in the 
present study were adequate for a thor- 
ough investigation of its possible effects. 
Although no rationale for a meaningful 
subdivision was found with this sample, the 
use of a brain-injured group as such was 
open to serious question, since brain-injury 
is a highly heterogeneous phenomenon. Ad- 
ditional study is also needed in connection 
with the various categories assigned to the 
miscellaneous group. 

Intelligence was the only factor which 
was found to be highly related to perform- 
ance on every test. Of particular interest in 
this connection are the results which dem- 
onstrated that a closer relationship was al- 
ways obtained between intelligence and 
learning performance than between intelli- 
gence and initial test performance. The 
factor of intelligence not only exerted a 
highly differential influence on pretraining 
as opposed to learning performance, but 
also on the various types of learning scores 
obtained, i.e., immediate learning, transfer, 
and retention. 

While subjects characterized by the low- 
est level of intelligence made significant 
gains in immediate posttraining learning on 
most of the tests, they did not succeed in 
transferring the training or in retaining 
the learning on any of them. Since the 
deficiencies of these subjects were most 
clearly demonstrated in the latter areas, 
it can be said that transfer and retention 
Scores were the most sensitive of the learn- 
ing measures to differences in ability. 

In contrast to the performance of the 
subjects at the lowest level of intelligence, 
the middle intelligence group not only 

ained in response to training, but was also 
able to transfer the training and to retain 
the learning to a highly significant extent 
on every test. While the high intelligence 
group also made significant gains in the 
three learning areas, the magnitude of their 


mean scores was always higher than that 
obtained by subjects in the middle intelli- 
gence group. 

Significant differences in the learning per- 
formance of subjects at different levels of 
intelligence were found in 59 out of 60 
comparisons. On the other hand, the mag- 
nitude of the differences in their respec- 
tive mean scores, as obtained by the pre- 
training administrations of the tests, was 
always lower than those found in their 
learning scores. Further, pretraining per- 
formance failed to differentiate the groups 
to a significant extent in 4 out of 15 in- 
stances. 

These results carry the implication that 
for severely mentally retarded children, and 
perhaps for others as well, tests which are 
administered only once may not be the 
most sensitive measures of differences in 
ability. 


Reliability 


Test-retest comparisons demonstrated a 
high degree of stability for all of the tests. 
However, the learning scores on every test 
were found to be significantly more stable 
than the pretraining scores, with coefficients 
of .90 or higher in every case. The learn- 
ing scores were also found to be signifi- 
cantly more closely related than were the 
pretraining scores to other learning trials 
on the same test, and to the same trials on 
different tests. 

These findings suggest that initial test 
performance was responsive to more spe- 
cific factors, possibly including previous 
experience. On the other hand, the greater 
stability and consistency of the learning 
scores suggest that a more general factor 
was measured. 


Validity 

The obtained results consistently attest to 
the construct validity of the tests. The 
evidence on its behalf began with the dem- 
onstrated adequacy of the battery in meet- 
ing the criteria established for the tests, 
which was a precondition of validity, In 
order to measure a child’s abilities, he must 
be placed in situations which are appropri- 
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ate to his age and to his level of ability, in 
which he is highly motivated to respond, 
and in which he is capable of responding 
within his limitations. 

The basis for construct validity lay in the 
particular learning abilities which were 
studied. Educability can virtually be de- 
fined in terms of a child's ability to learn 
in response to instruction, to transfer the 
training, and to retain the learning. Schools 
are established in order to obtain the de- 
sired outcomes of learning, and “it might 
almost be said that if there is to be educa- 
tion there must be transfer" (Pressey & 
Robinson, 1944, p. 573). 

The particular areas in which the learn- 
ing abilities were investigated are also 
thought to contribute to the construct vali- 
dity of the battery, in that imitative abilities 
are important in the training of these chil- 
dren, and abilities to make shape, color, and 
size discriminations are emphasized in the 
curriculum. 

The shape of the learning curves, consist- 
ently obtained by the various tests and per- 
sisting across time, was also in close accord 
with the constructs. The lowest scores 
made by the group were always found in 
the pretraining administrations of the tests. 
dass Eae the highest scores were 

BI ên EAE 
y after train- 
ing, where one would assume that the train- 
ing would have its maximum effect. 

Also in accord with the constructs, the 

transter scores made by the group were 
significantly higher than initial scores and 
lower than immediate learning gains, indi- 
cating that the group successfully trans- 
ferred some of the training to other situa- 
tions. Finally, the retention performance 
of the group was significantly higher than 
the original scores and lower than post- 
training learning gains. Although the prac- 
tice effect which the use of the transfer 
forms may have introduced is not known, 
and although the possible influence of each 
retest on subsequent ones was not deter- 
mined, the learning was retained to a sig- 
nificant extent, without repetition of the 
actual training. 


The validity of the constructs would also 
require that the measurements be stable 
over time, and, if more general learning 
abilities were tapped, that a high degree of 
consistency among the learning scores be 
present. Further, if learning behavior, аз 
obtained on the tests, was related to actual 
differences in ability, the magnitude of the 
learning scores would be different at differ- 
ent levels of ability, and would increase as 
the level of ability increased. It would also 
be necessary that the learning scores be 
significantly more sensitive than the pre- 
training scores to these differences. These 
results were found on every test. m 

The results give evidence of predictive 
as well as of construct validity for the bat- 
tery. The specific power of the tests to 
predict the ability of the child to learn !n 
the classroom situation, which would be 
required if the battery were to be used for 
purposes of selection or for making actua 
educational decisions, remains to be demon- 
strated. Meanwhile, the obtained results 
offer indirect evidence that the tests would 
be useful for such purposes. This evidence 
was obtained in the comparisons of the test 
scores with the external criteria. Before 
these relationships are considered, the cri 
teria themselves will be briefly discussed. 

Teachers’ ratings constituted the closest 
approach to a criterion of educability P*' 
se in this study, in that they reflected the 
actual experience of the teachers with the 
children’s classroom learning over time. The 
ratings were made by teachers with spec? 
training and experience in the area of те!" 
fal retardation, who were personally fa- 
miliar with the children whom they rated. 
While subjective factors probably entere 
into their ratings, some amount of objec- 
tivity could reasonably be anticipated. The 
reliability of the ratings was indicated PY 
the highly significant agreement found i" 
the comparison between independent rating? 
of two teachers evaluating the learning per 
formance of the same child. 


A highly significant relationship was ао 
found between the teachers’ ratings а? 
psychologists’ estimated IQs, each [еше 
unaware of the others estimate. М!" 


a 
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this agreement may show that the two 
measures supported each other, it must be 
borne in mind that the psychologists’ rec- 
ords were available to the teachers, so that 
the extent to which the teachers were influ- 
enced by this information cannot be deter- 
mined. 

It can be said, however, that while mak- 
ing the actual ratings, the teachers reviewed 
their own longitudinal records of the child's 
performance in the classroom, and did not 
refer to the psychological records at that 
time. It can also be said that, although the 
ratings of the teachers were found to be 
significantly related to all of the other 
criteria, the agreement was not so close as 
to preclude the possibility of independent 
judgment. 

Psychologists' estimated IOs, on the other 
hand, were found to be very closely related 
to the scores of standard tests, especially 
to IQs on the Revised Stanford-Binet Scale 
where obtainable. This close relationship, 
of course, may well be due to the fact that 
the psychologists based their estimates very 
largely on psychometric data in the chil- 
dren’s records. 

However, since some amount of subjec- 
tivity was probably unavoidable in the esti- 
mates of the psychologists, the highly 
significant relationships between this meas- 
ure and the scores of standard tests were 
desirable. The reliability of the measure 
was indicated by the highly significant cor- 
relation between two such estimates made 
independently by different psychologists 
evaluating the same child. 

The Vineland Social Maturity Scale is 
a measure of “social competence,” which 
its author defines as "the fuctional ability 
of the human organism for exercising per- 
sonal independence and social responsibil- 
ity” (Doll, 1953, p. 12). The concept is 
directly relevant to the training of these 
children, since the curriculum places major 
emphasis on the development of traits which 
the construct reflects. This was the only 
standard test that could be used with all of 
the subjects. since a social age and a social 
quotient could be derived on the basis of 
data supplied by the parent, rather than 
bv the subject himself. 


The comparisons of the test scores with 
the external criteria demonstrated that the 
results obtained by the various administra- 
tions of the tests were significantly related 
to all of the criteria. However, a marked 
difference in the degree of the relationship 
was found for initial test performance on 
the one hand, and for the learning scores 
on the other. 

In all cases, the correlations between learn- 
ing performance and psychologists’ esti- 
mated IQs were significantly greater than 
those obtained between estimated IQs and 
initial test scores. In most cases, the rela- 
tionships found between learning perform- 
ance and teachers’ ratings were significantly 
greater than those between the pretraining 
scores and this criterion. Also in most 
cases, the learning scores were significantly 
more related to Vineland SQs than was 
initial performance on the tests. 


It can therefore be said that the learn- 
ing scores obtained were better predictors 
than were the initial test scores of intelli- 
gence, classroom learning, and social com- 
petence, as measured by estimated IQs, 
teachers’ ratings, and Vineland SOs. re- 
spectively. 


SuMMARY AND CONCLUSIONS 


The purpose of the study was to investi- 
gate a method for obtaining an objective, 
quantified, predictive measure of educabil- 
ity for the severely mentally retarded child. 
The method was designed to measure abil- 
ities оп which education depends, namely, 
to learn under training, to transfer the 
training, and to retain the learning. 

The subject was first given a battery of 
tests without training. He was then trained 
to the correct responses under standard 
conditions, and retested so that learning 
gains could be measured. Different forms 
of the tests were used to study transfer, 
and retention scores were obtained by re- 
testing the subjects at various later dates. 

Criteria established for the adequacy of 
the tests required that they be appropriate 
to the abilities and ages of the subjects; 
capable of arousing and maintaining their 


interest; and suitable for their verbal, 
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motor, perceptual, and experimental handi- 
caps. Areas selected for study were imita- 
tive abilities, memory, and gross shape, 
brightness, and size discrimination. 

A series of pilot studies was conducted 
with a preliminary battery which consisted 
of 11 tests. The chief aim of the pilot 
studies was to bring the tests increasingly 
into accord with the criteria. Tests were 
altered, substituted, accepted, or rejected 
on this basis. 

The sample consisted of 114 severely 
mentally retarded subjects, living in the 
community, and attending special classes 
and/or clinics for mentally retarded chil- 
dren. They suffered from various patho- 
logical physical conditions known to pro- 
duce mental retardation. Their estimated 
IQs (a measure made by the evaluating 
psychologist, based on quantitative and 
qualitative analyses of the child's psycho- 
logical records) ranged from 8 to 55, mean 
36.18, standard deviation 12.01. Тһеу had 
attended school from 0 to 36 months, mean 
11.52, standard deviation 10.84. Their ages 
ranged from 61 to 143 months, mean 95.16, 
Standard deviation 23.48. There were 69 
boys and 45 girls, 

The final battery, which consisted of five 
tests, was administered to the subjects nine 
times. The first three administrations 
yielded pretraining, posttraining, and trans- 
fer Scores, respectively. The fourth and 
fifth administrations measured retention 
under different conditions and at different 
times. The last four administrations were 
repetitions of the first four, given after an 
interval of seven weeks. 

The results demonstrated that the tests 
were adequate in meeting the criteria estab- 
lished for them. It was also shown that the 
battery was highly stable. However, as 
compared with pretraining performance, 
the learning scores (1.е., posttraining, trans- 
fer, and retention) were found to be sig- 
nificantly more stable and significantly more 

consistent both within and across the tests, 
suggesting that they measured a more gen- 
eral factor. 

Comparisons of the scores on different 
administrations of the battery demonstrated 
that the group as a whole learned, trans- 


ferred the training, and retained the learn- 
ing to a highly significant extent on all of 
the tests. It was also shown that the factors 
of sex and time spent in school did not 
significantly influence performance on the 
battery. Age and diagnostic category were 
found to have a limited effect on the scores 
of some of the tests. 

The variable of intelligence was demon- 
strated to affect the scores of all of the 
tests, exerting a highly differential influence 
both on pretraining as opposed to learning 
performance, and also on the various kinds 
of learning performance measured. Trans- 
fer and retention scores were found to be 
the most sensitive measures of differences 
in ability, suggesting that tests which are 
administered only once may not be the most 
discriminating indicators of these differ- 
ences. 

Evidence of construct validity was ob- 
tained in the results which showed that 
learning performance was more stable an 
more consistent than pretraining perform- 
ance, in the consistent and stable learning 
curved obtained by the group on different 
tests in close accord with the constructs. 
and in the significantly greater relationship 
demonstrated between the learning scores 
and level of ability than between pretratn- 
ing performance and ability level. 

The results also gave partial Четопз 8” 
tions of the predictive validity of the bet. 
tery. Learning performance, as measure 
by the tests, was found to be a better PTC 
dictor than pretraining test performance of 
intelligence, classroom learning, and social 
competence, as measured by psychologists 
estimated IOs, teachers' ratings, and Vine- 
land 505, 

Within the limitations of the study, the 
following conclusions are thought to De 
warranted : 


l. The battery was highly stable and 
consistent, 

2. Learning performance was significant" 
ly more stable and more consistent tha? 
pretraining performance. 1 

3. The factors of sex and time in scho? 


did not significantly affect performance o 
the battery. 
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‚ 4 Age and diagnostic category had a 
limited effect on the scores of some of the 
tests. 

5. Intelligence exerted a highly signifi- 
cant influence on the scores of all of the 
tests. 


6. Transfer and retention performance 
were the most sensitive measures of differ- 
ences in ability. 

7. Evidence of construct validity and 
partial demonstrations of predictive validity 
for the battery were obtained. 
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APPENDIX A 


MATERIALS 


Two-Butions Test 


Materials. Two clear plastic boxes, 3.5 
inches high, with sides 3.5 inches along the 
bottom, and 4 inches along the top. The boxes 
were attached, 3.5 inches apart, to a black board 
10.5 inches long, and 3.5 inches wide. 

„Eight round white buttons, 1.75 inches in 
diameter, 

Testing Procedures. There were no special 
pretest procedures for this test, since demon- 
Strations preceded each item. For each item, 
the Subject was required to indicate the box into 
which each button presented should be placed, 
in imitation of the examiner’s demonstration. 
If necessary, the examiner picked up the but- 


ons for the subject, and торр і 
ti f Я а ed them into 
the boxes at his indication. 


The test consisted of six items, whch are 
shown on the scoring sheet (Table Al). Each 

L" on the Scoring blank represents one but- 
ton to be dropped into the box at the subject's 
left. Each “В” stands for one button to be put 
into the box at his right. The first three items 
required the correct placement of single but- 
tons, one to be dropped into the box at the 
left, the next into the box at the right, and the 
third into the box at the left. The fourth item 
consisted of two buttons, the fifth of three, and 
the sixth of four. The buttons were dropped 
into the boxes in alternating order, always be- 
ginning at the left. i 

Boxes were placed about 2 inches from the 
edge of the table. Buttons were presented ap- 
proximately an inch from the edge of the 
table, and 3 inches apart. Before each item 
was presented, the examiner demonstrated the 


^! Photographs were taken at the Shield of 
David Institute for Retarded Children, 


AND PROCEDURES FOR ADMINISTERING TIE TrsTs^* 


correct response, dropping the buttons into the 
boxes from a height of approximately a foot 
to ensure a sharp click. As the buttons struck 
the box, the examiner said, "Ping!" in a sing- 
ing tone, drawing out the final “ng.” After 
dropping each button, the examiner tapped the 
box into which it had been dropped, and said, 
"in here." Buttons used in the demonstrations 
were not removed until the subject's response 
was secured. p 

After each demonstration, the examiner 
placed the number of buttons required for the 
item before the subject, and said, “You do it, 
pointing from the subject to the buttons, ano 
then to the boxes, midway between them. A 
buttons were removed before the next demon- 
stration. The examiner said, "Ping" whet 
ever the subject gave a correct response. 
the response was wrong, the examiner cor- 
rected it, saying, "Ping!" as the buttons were 
placed correctly. 

The first three items were always given. If 
all three were failed, the test was discontinued, 
and training was started. Testing was also 
discontinued if the subject failed the second; 
third, and fourth items. If he failed the fourth 
and fifth items, the sixth was omitted. 


Training Procedures. Training began with 
the subject's first failure unless it had occurre 
on one of the first three items, in which case 
training began with the first item. Demonstra 
tions were made as before. The examiner oca 
presented the buttons to the subject, say”? 
"in here" for each button, pointing to the О 
rect box, and tapping it until the response was 


made. Again, “Ping |" accompanied all corre 
ог corrected responses, 


The subject was trained three times on each 


item, unless he achieved two consecutive SUC 
cesses. In that event, the third training WS 
omitted. 


Training was discontinued if there 
Was complete failure on the first three items. 
Training was also stopped if the second, third, 
and fourth items were failed. Training on the 
sixth item was omitted if no successes were 
achieved on the fourth and fifth. 

The Transfer Form. The original items were 
those used in the transfer situation. However; 
the demonstrations were omitted. In advanc 
of each presentation, the examiner pointed in 
the correct box or boxes, saying, "Ping! "s 
here,” or “Ping! in here” and “Ping! in 217 
and so оп. As before, correct or corrected 
sponses were accompanied by, “Ping!” 
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TABLE A1 
Score SHEET: Two-BurroNs TEST 


Administration A 
(DL OF бф OLR FEEL кик Е 


L-R-L-R 
L R L L-R т  L-R-L-R 
L 


Administration B 
WL @R (GL (LR (LRL 6) L-R-L-R bens 


Administration C 
(DL QR GL (LR GLRL © L-R-L-R Total 


ü)L (QR BML (MIR @LRL (6) L-R-L-R Total 


(1) L (2) R (3) Г. (4) L-R (5) L-R-L (6) L-R-L-R Total. 
Comments: 
Buss Test board, which joined at a 90-degree angle. 


Four black discs, 1 inch in thickness, and with 
diameters of 3, 6, 4, and 5 inches, respectively, 
were attached to the board in that order, with 
equal distances between them. Thus, beginning 
at the left, the smallest disc was first, the 
largest second, the next to the smallest third, 
and the next to the largest fourth. The sample 
discs on the board were wired so that, when a 
disc of corresponding size was placed over one 
of the samples, a continuous buzzing sound 
was produced. Two blunt nails projected from 
the board below each sample, to hold a disc ot 
comparable size in place. 


E 9 One set of separate black discs and one set 
Materials. А gray board, 23 inches long and of separate white discs, one disc in each set 
75 inches wide, attached at an angle of ap- corresponding in size to each of the sample 


proximately 45 degrees to a base and a back discs on the board. 


Pretest Procedures. The board was placed 
about 4 inches from the edge of the table. The 
examiner held the smallest black disc at the 
center of the board, about 1 inch from the edge 
of the table, at an angle corresponding to the 
tilt of the board. After holding the disc in this 
position for several seconds, the examiner 
placed it over the corresponding sample on the 
board, saying, ^This goes here." Aíter allow- 
ing the resulting buzz to continue for a few 
seconds, the examiner returned the disc to the 
center of the board, holding it as before. Point- 
ing from the subject to the disc, and then to 
the middle of the board, the examiner said 
"You do И.” The examiner placed the disc at 
the subject's indication, if necessary. One 
demonstration and one unscored response were 
allowed for each of the black discs. 
were corrected. 


Testing Procedures. For each item, the ex- 
aminer held a black disc in the position de- 


Scribed above, asking each time, 
this go?" 


АП errors 


i "Where does 
and pointing from the disc to the 
board. The subject was required to indicate a 
choice of one of the 
placing the disc for hi 
were corrected bef 
item. 


samples, the examiner 
m, if necessary. Errors 
ore continuing to the next 


Ó There were six items, each involving match- 
ing one black disc to the corresponding sample. 
The Scoring sheet (Table A2) shows the order 
in which the discs were presented. The number 
їп parenthesis on the Scoring sheet refers to 
the number of the item. The number which 
follows shows the position on the board of the 
sample Corresponding in size to the disc pre- 
sented, i.e., “2” represents the largest sample, 
which Was second on the board. All of the 
items were given, unless the subject failed the 
first four, in which case the last two were 
omitted. 
Training Procedures. Training always be- 
gan with the first item. The examiner began 
by demonstrating the response, saying, “This 
goes here,” producing the buzz, and then re- 
moving the disc and presenting it to the sub- 
ject, as before. The examiner pointed to the 
correct sample, and continued to tap it, repeat- 
ing, “Here,” until the response was made. If 
errors still occurred, they were corrected by the 
examiner, 
A special order of the items was used in 
training on this test, in order to minimize the 
tendency to perseveration. Instead of the usual 
procedure of completing training on each item 
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before beginning training on the next, the sub- 
ject was trained once on all items, in the order 
in which they were presented in the test, be- 
ginning with Item 1. The entire series was 
repeated three times, unless no errors were 
made in the course of the first two series. 

The Transfer Form. Using the same items, 
and following the original procedures, the 
white discs were presented to the subject, to 
be matched to the black samples of correspond- 
ing sizes on the board. 


Music Box Test 


Materials. Five round music boxes, 2 inches 
high and 3 inches in diameter. A handle, which 
could be turned in either direction, project 
2 inches from the center of the box. The sh 
of the handle extended vertically from the di 
for a distance of 1 inch, and was then bent 2 
a right angle, continuing horizontally for а" 
other inch, and ending in an upright woode? 
cylinder, 0.5 inch in diameter and 1 inch 1018 
One box was painted black, one white, 016 
gray, one black and gray, and one gray an^ 


white. Only the gray box functioned, playing 
а repeated series of four tones. 


Five rectan 
high, 4.5 inch 
other respects 


gular music boxes, 2.5 inches 
es long, and 2.75 inches wide, i! 
like the round boxes. 


Pretest Procedures. The round gray box 
was placed before the subject, about an inch 
from the edge of the table, with the handle 
toward him. The examiner turned the handle 
for Some 3 seconds, saying, “music, music 
music.” The examiner then said to the subject 
You make music,’ pointing from the subje“ 
to the gray music box. If necessary, the €*, 
aminer placed the child's hand over or again? 
the handle and turned it for him, repeat?’ 
"music, music, music." 
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TABLE A2 


SCORE SHEET: 


Administration А 


(01 Оз @2 Фа G1 (94 
Training 
132814—132414 — 
Administration B 

(01 сз Q2 (043 61 (94 
Administration C 

(1) 1 (2) 3 (3) 2 (4) 4 (5) 1 (6) 4 


23 
Buzz ТЕЗТ 
Total. 
132414 
Total 
Total 


Administration D 


@ 1 (2) 3 (3) 2 (4) 4 (5) 1 (6) 4 
Administration E 

ai Qs 92 @4 G1 ©4 
Comments: 


The gray box was left on the table, but was 
pushed aside, some 6 inches to the subject’s 
left. The round black and gray box was pre- 
sented in center position, its black side toward 
the gray box. The examiner turned the handle 
about 3 seconds, shaking his head and saying, 
no music, no, no music.” The subject was 
then asked to turn the handle, as before, the 
examiner repeating, "no music" as he re- 
sponded. 

The black and gray box was pushed to the 
subject's right, and the round gray and white 
box was presented, with the white side toward 
the gray box. The demonstration and response 
were repeated. The box was then pushed to 
the subject's left, and the gray box was pre- 
sented again. The examiner played it, again 
for about 3 seconds, saying Here's the music," 
and asking the child to play it, as before. The 


boxes were removed before the test began. 


Testing Procedures. Boxes were presented 
about 4 inches apart, the handles turned toward 
the subject. At each presentation, the examiner 
said, “Уон make music," pointing from the 
child to the music boxes. The subject was 
required to indicate his choice, the examiner 
turning the handle of the box he selected, if 
necessary. If the subject's response was wrong, 
the examiner pointed to the gray box, turned 
the handle, and said, "Here's the music.” 

The shifts in the positions of the boxes were 
made with the subject watching. However, to 
prevent uninterrupted visual fixation on the 
gray box, the subject's attention was briefly 
diverted during each shift. To keep the inter- 
ruption as standard as possible, the examiner 
said, “Look at me, (subject’s name)," and 
then redirected the child’s attention to the 
table. This required 2 or 3 seconds. 


The scoring sheet (Table A3) shows the 
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TABLE A3 
SCORE SHEET: Music Box TEST 
Administration A 


L ; 5 IG /-B-B/G 
GB (2) W-G (3) G-W-B (4) B-W-G-W/G (5) W/G-G-W-B-B/ 
e n ыы (6 W-B-B/G-G-W/G E 


Training 


св WO EWS PW GAUG W/ccw-p-B/G УСМ 
GB WG GWB B-W-G-W/G W/G-G-W--B/G W-B-B/G-G-W/G 
GB WG G-W-B B-W-G-W/G W/G-G-W-B-B/G W-B-B/G-G 


Administration B 


(0 С-В (2) М-С (3) G-W-B (4) B-W-G-W/G (5) W/G-C-W-B-B/G sei 
(6) W-B-B/G-G-W/G on 


Administration С 


G) GB (2) М-С (3) G-W-B (4) B-W-G-W/G (5) W/G-G-W-B-B/G : 
(6) W-B-B/G-G-W/G Til 


Administration D 


CB Q) W-G (3 G-W-B (4) B-W-G-W/G (5) W/G-G-W-B-B/G 
(6) W-B-B/G-G-W/G Total 


Administration E, 


GB Q)Ww-G (3) G-W-B (ау B-W-G-W/G (5) W/G-G-W-B-B/G 
(6) W-B-B/G-G-W/G 


Comments: 


number of boxes used for each of the six 
items, and the order in which they were pre- 
sented. The letter "G" on the sheet represents 
the gray box, "B" the black, “W” the white, 
“B/G” the black and gray, and “W/G” the 
white and gray. Two boxes were used for 
each of the first two items. Three were used 
for the third, and four for the fourth. Five 
boxes were used for the fifth and sixth items. 
The two-toned boxes were not introduced into Training Procedures. It the subject's first 
the test until the fourth item. Опе of them was failure occurred on one of the first two items 
used in the fourth item, and both in the fifth training began with the first. Otherwise: % 
and sixth. In the fifth item only one of the began at his first failure. The examiner Pt 
two-toned boxes was placed next to the gray sented the Нет, turned the handle of the 878 


box. In the sixth item, the gray 
sented with a two-toned box on each side- 
The two-toned boxes were always presente“ 
with the gray side away from the gray Бох: 
The first three items were always given. The 
test was discontinued if these three items, or 
the second, third, and fourth items were failed- 
The sixth item was not given if the subject 
failed the fourth and fifth. 


box was pre- 
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box, and said, "music, music, music." Then, 
pointing from the subject to the gray box, the 
examiner said, “Уон make music," tapping the 
gray box, and repeating, "Here's the music," 
until the response was secured. If necessary, 
the examiner placed the handle of the gray 
box into or against the subject's hand, turning 
the handle for him, and saying, “Music, here's 
the music." No interruptions were made in the 
shifts of materials during training. 

The subject was trained three times on each 
item, or through two consecutive successes. 
If all test items were failed originally, train- 
ing was always given оп the first three 
items. It was discontinued if these were failed 
completely, or if complete failure occurred in 
training on the second, third, and fourth items. 
Training was omitted for the sixth item, if no 
successes were achieved on the fourth and 
fifth. * 

The Transfer Form. The rectangular music 
boxes were used in place of the round ones. 
The original testing procedures were followed, 
and the original test items were used. The 
round set was pushed to the back of the testing 
table, but was not removed, in an attempt to 
emphasize the fact that different boxes were 
being presented. 


Light Test 


Materials. Two gray boxes, each 14.5 inches 
long, 11 inches wide, and 6.5 inches high. 
White knobs were used on one of the boxes, 
and corresponding black knobs on the other. 
Above the knobs, on each side of the box, was 
a plastic covered "window," 6 inches long and 
] inch wide. Behind the windows were a series 
of flashlight bulbs, which lit when the rectangu- 
lar knob was pressed. A single rectangle ap- 
peared on each side. А rectangle and a sphere 
were used on Side 1. Side 2 contained a dia- 
mond and a rectangle. Side 3 included a 


rectangle, а diamond, and a triangle. А dia- 
mond, a rectangle, and a triangle, and a sphere 
were used on Side 4. The spheres were 2 
inches in diameter. The triangles were 2.5 
inches on each side. The diamonds were 2.75 
inches long, and 1.5 inches wide. The rec- 
tangles were 2.5 inches high and 1 inch wide. 

Pretest Procedures. The box with white knobs 
was placed about 2 inches from the edge of the 
table, Side 1 facing the subject. The examiner 
pressed the rectangular knob, pointed from the 
knob to the lighted window, and said, "Light, 
see the light.” Then the examiner pressed the 
sphere, pointed from the knob to the unlit 
window, shook his head, and said, "No light, 
see, no light." Next, the examiner pressed 
the rectangle again, pointing as before, and 
saying, “Here's the light.” Turning to the 
subject, the examiner said, “You make light," 
pointing to the subject, and then to the box. 
If necessary, the examiner pressed the knob 
at the subject's indication, and corrected the 
response in case of error. Аз the rectangle 
was pressed, the examiner pointed again to the 
window, repeating, "See the light.” The ex- 
aminer then turned the box so that Side 2 
faced the subject, and the entire demonstration 
was repeated. 

Testing Procedures. The test contained eight 
items, each consisting of the presentation of 
one side of the box. The order in which the 
sides were presented is shown on the scoring 
sheet (Table A4). As each side was presented, 
the examiner said, “You make light," pointing 
from the subject to the box. If necessary, the 
examiner pressed the knob which the subject 
indicated as his choice, and corrected any 
errors made, All items were given unless the 
subject failed the first four. 

Training Procedures. Training began with 
Side 1. The examiner pressed the rectangular 
knob, saying, "Here's the light," pointing 
from the rectangle to the lighted window. The 
subject was then asked “to make light,” the 
examiner pointing to the rectangular knob 
and tapping it until the subject’s response was 
made. The examiner repeated “Here’s the 
light” as the rectangular knob was pressed, 
and corrected errors if necessary. 

The subject was trained three times, or 
through two consecutive successes, on each 
side of the box. Training was given on each 
of the four sides, unless the subject achieved 
no successes in training on the first three sides. 


The Transfer Form. The original proce- 
dures were followed, using the box with black 
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TABLE A4 
Score SHEET: LIGHT Test 
Administration A 
(Oe Q9 "s Ша (51 (602 (03 (8) 4 Тов 


Training 


Side 1 Side 2 Side 3 Side 4 
Side 1 Side 2 Side 3 Side 4 
Side 1 Side 2 Side 3 Side 4 


Administration B 


(01 02 (3 (204 (51 (62 буз (8) 4 Total 


Administration C 


01 @2 Q3 Фа (91 (92 (3 (8) 4 Total__——— 


Administration D 


Чї @2 з ша (1 (92 (пуз (894 Total__— 


E 07 Os we (oi өз фз (8) 4 Total. E 


Comments: 


knobs. The first box 
of the testing table, 
sight. 


was pushed to the back 


Materials. Th i s 35 
but was not removed from i cican plastic boxes, 


inches high, 3.5 inches along the bottom, and 
4 inches along the top. The boxes were place 
3 inches apart, and attached on parallel sides 
Three-Buttons Test to two wooden runners, 17 inches long, 1 inch 
wide, and 1 inch high. і 

Nine round buttons, 1.5 inches in diameter; 
3 black, 3 white, and 3 gray. 

Pretest Procedures. The boxes were placed 
about 2 inches from the edge of the table, The 
examiner dropped a sample white button int? 
the box at the subject’s left, saying, "Ping! 
as the button struck the bottom of the bo*: 
Leaving the sample in place, the examine 
placed another white button before the subje 
about 1 inch from the edge of the table к. 
approximately at the center point of the п! 
dle box, and said, “You do it,” pointing ff? 
the subject to the button, and then to the 
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containing the sample. If necessary, the ex- 
aminer placed the button at the subject's 
direction, and corrected or correctly completed 
the response. These procedures were repeated 
with black and gray buttons, using the middle 
and the right hand boxes, respectively. The 
examiner added “Ping!” as each button was 
correctly placed. The sample buttons remained 
in the boxes throughout testing and training. 


Testing Procedures. Buttons were presented 
about an inch from the edge of the table. 
Single buttons were presented approximately 
at the center of the boxes. For the two-button 
items, one button was placed at the mid-point 
of the space between the left and center boxes, 
and the other at the corresponding point be- 
tween the right and center boxes. For the 
final item, the middle button was placed at the 


center of the boxes, and the others at the mid- 
points of the spaces between the center box and 
those to the left and right, respectively. 

The buttons used for each item were re- 
moved after its completion. Buttons used in 
each of the six items are shown on the scoring 
sheet (Table A5). “W” represents a white 
button, "B" a black one, and “С” a gray one. 
The first three items consisted of placing a 
single white, black, and gray button into the 
box containing the corresponding sample, the 
examiner dropping the buttons into the boxes 
at the subject's direction, if necessary. The 
fourth item consisted of a white and a gray 
button, and the fifth of a gray button and a 
white one, presented in that order. A black, 
a gray, and a white button were used for the 
sixth item. A correct response for the latter 


TABLE A5 


Score SHEET: THREE-Butrons TEST 


Administration A 


0) W (2) B (3) G (4) W-G (5) G-W (6) B-G-W Total 
Training 

wW B с WG G-W B-G-W 

ү В с мс G-W B-G-W 

ү В G W-G GW B-G-W 

Administration B 

(1) W (2) B (3G (4) W-G (5) G-W (6) B-G-W Тоа! = 
Administration C 

абс Qw (в ()GB (5) B-G (6) W-B-G Total 


Administration D 


(0W ов (с @ WG (9GW (6 B-G-W е iur. d 
Administration E 
Dow qub Fe Фа eee Total 


Comments: 
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items required the correct placement of all of 
the buttons presented. 

As each item was presented, the examiner 
asked, “Where does this go?" or, “Where do 
these go?" pointing from the buttons to the 
boxes. The examiner accompanied each cor- 
rect placement with a "Ping!" Errors were 
corrected, or correctly completed by the ex- 
aminer. The first three items were always pre- 
sented. Testing was discontinued if the sub- 
ject failed the first three items, or the second, 
third, and fourth. The last item was omitted 
if the fourth and fifth were failed. 

Training Procedures. Training was given 
three times on each item, or through two con- 
secutive successes. Training always began 
with the first item. However, if the subject 
had passed the earlier items on the test, he was 
trained only once on those items, regular train- 
ing beginning at the point of his first failure. 

Before each item, the examiner demonstrated 


the correct placement for each button, pre 
sented the item, and pointed to the correct box 
until the subject had responded, saying, “т 
here,” while tapping the edge of the box. All 
errors were corrected, and a “Ping!” accom- 
panied a correct response made by the subject, 
or by the examiner at the subject’s indication. 
Training was discontinued if there was сот- 
plete failure on the first three items, or on the 
second, third, and fourth. Training was 
omitted on the sixth item if the subject failed 
the fourth and fifth. 

The Transfer Form. The samples for the 
original form of the test were removed, and 
those used for the transfer situation were 
placed into the boxes by the examiner, follow- 
ing the procedures used in placing the origina 
samples. The positions of the samples, and the 
buttons used for each item, are shown Jn 
Table A5 (see Administration C). The ors! 
nal testing procedures were followed. 
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APPENDIX B 


Tue EXPERIMENTAL Test BATTERY 


This section describes the preliminary bat- 
tery used in the pilot studies. The various 
revisions of the tests which were ultimately 
included in the final battery are described first. 
The rejected tests are described thereafter. 


Revisions of Tests Used in the Final Battery 


The Two-Buttons Test underwent eight re- 
visions in the pilot studies. The test originally 
contained five items. The first two consisted 
of single buttons, one to be placed in the box 
to the subject’s left, and the other in the box 
to his right. Two, three, and four buttons 
were then presented, to be dropped into alter- 
nate boxes. The resulting scores were too low. 
The earlier pilot studies therefore concentrated 
on adding various easy items. 


The addition of two single-button items was 
attempted first. This, however, produced a 
marked increase in perseveration. When the 
new items required reversing the order of 
placement used for the original items, the sub- 
ject found the shift almost impossible to make, 
and training merely tended to increase this 
difficulty rather than to overcome it. On the 
other hand, when the added items duplicated 
the original order, the subjects continued using 
that order regardless of the number of buttons 
presented. Perseveration decreased to a satis- 
factory extent when the number of single- 
button items was reduced to three, and only 
an alternating order was required for all 
placements. 

Results of various changes in the multiple- 
button items were fairly consistent. Items con- 
taining odd numbers of buttons were found to 
be more difficult than even-numbered items, to 
the extent that training for a three-button item 
tended to be less successful than training for 
ап eight-button item. However, once a subject 
had grasped the concept of placing buttons in 
alternating order, increasing the buttons in 
even numbers presented little additional diffi- 
culty. Items consisting of six and eight but- 
tons were therefore dropped as superfluous. 

Of the various odd-numbered items at- 
tempted, only one three-button item was re- 
tained, since inclusion of two items using odd 
numbers of buttons made the test too difficult. 
The three-button item was tried out in various 

ositions, and was found to be most useful in 


next-to-last position. 


Various transfer forms for this test were also 
studied. In some, the examiner used the origi- 
nal white buttons for the demonstrations, pre- 
senting black and/or gray buttons to the sub- 
ject. These were found to be too easy, since 
higher scores than on the retest administration 
were obtained. On the other hand, when black 
and white buttons were both presented to the 
subject, the resulting scores were too low. 
Omission of pretest demonstrations and in- 
structions also made the form too difficult, the 
subjects apparently failing to understand what 
was required of them. However, omitting the 
demonstrations, but pointing to the correct box 
or boxes in advance of the presentations, 
seemed to provide sufficient cues. 

Of the various sounds tried out as the verbal 
accompaniment of the demonstrations, “Ping!” 
was found to be the most interesting to the 
children, especially when pitched high and 
given a drawn-out “ng” at the end. Virtually 
all of the subjects smiled or laughed at this 
sound. Because of its evident appeal, the sound 
was reserved as an accompaniment for correct 
responses only, to increase motivation and re- 
inforce learning. 

The Buss Test was revised six times in the 
preliminary experimentation. As it was origi- 
nally constructed, the sample discs were аг- 
ranged on the board in order of size. A round 
gray peg projected at right angles from the 
center of each sample, and extending 2 
inches beyond it. Each of the discs presented 
to the subject had a round hole, 1.5 inches in 
diameter, at the center. The discs slipped over 
the pegs projecting from the samples, the buzz 
being produced by pressing a disc against its 
corresponding sample on the board. The ma- 
jority of the subjects were unable to make the 
necessary size discriminations for this version 
of the test. It appeared to be especially diffi- 
cult for the brain-injured children. 

The pegs were removed in the next pilot 
study, and discs without holes were used. To 
produce the buzz, the presented disc was laid 
over the corresponding sample, supported by 
blunt nails driven into the board just below 
the samples. This change made the test easier, 
and did not appear to penalize the brain-injured 
subjects. However, perseveration, especially 
after training, was so great that some of the 
subjects did not shift from the first response 
for the remainder of the test. It is interesting 
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to note that perseveration occurred to a sig- 
nificant extent only on this test and the pre- 
ceding one. It was virtually absent on all other 
tests throughout their revisions. 

In an attempt to overcome perseveration, 
the differences between the sizes of the discs 
were increasd. This was unsuccessful. In the 
next attempt, the arrangement of the samples 
was changed, to avoid a regular increase in 
size. The smallest disc was placed first and 
followed by the largest. The next to the small- 
est came next, with the next to the largest, 
last. Perseveration was almost entirely over- 
come. However, the test was now too easy. 

To increase the difficulty, presenting each 
disc twice was attempted. Perseveration in- 
creased sharply. Presenting the discs in the 
order in which they were used in the test, and 
then repeating the series again in that order, 
overcame the tendency to perseveration, but 
made the test too long to hold interest. Two 
items were therefore dropped. One complete 
series was given, plus two additional items con- 
sisting of one placement at the extreme left 
and one at the extreme right. This did not give 
rise to perseveration, and shortened the test 
sufficiently. 

The Music Box Test was changed six times. 
In its original form, it consisted of two gray 
boxes and two white boxes. The one func- 
tioning box was black. Since this form was 
found to be too easy for purposes of discrimi- 
nation, two additional boxes were used, one 
gray and one white. However, this increase 
in the length of the test tended to make the 
subjects restless. The number of boxes was 

i. again to five, and one of the gray 

box as darkened considerably. No marked 

Increase in difficulty resulted. However, when 

the boxes were repainted so that the darker 

gray box played, the subjects found it too 
difficult to distinguish between the two shades 
of gray. 

A two-toned box, half white and half black, 
was introduced in the next attempt, the single 
gray box playing. Again the test was found 
to be too easy. Two two-toned boxes, one black 
and gray and one gray and white were tried 
next, presented with their gray sides facing 
the gray box. This tended to confuse the sub- 
jects, especially the brain-injured children. In 
the final form, the two-toned boxes were al- 
ways presented with their gray sides away 
from the gray box, which considerably les- 
sened the confusion, and also made the test 
more suitable for the brain-injured subjects. 
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Since the original rectangular boxes were 
satisfactory for the transfer form, no changes 
were made in their size and shape. However, 
they were painted to correspond with the 
round boxes in each revision. . 

The Light Test underwent seven revisions. 
Only three shapes were used in its original 
construction : spheres, triangles, and rectangles, 
the latter being the correct response. Three 
knobs were the maximum used on any one 
side of the box. Although this form was too 
easy for the group as а whole, the first two 
sides were not changed, the simpler items be- 
ing necessary for discrimination among the 
more retarded subjects. The revisions there- 
fore concentrated on increasing the difficulty 
of the remaining items. Д 

А larger box was constructed. The fourth 
side now containing four knobs, mae 
second sphere. Since this did not increase b 
difficulty sufficiently, all spheres were remove 
from the fourth side, and a diamond was intro- 
duced. This was still too easy. One sphere 
was therefore added, the rectangle being 11 
serted between the sphere and the diamond. 
Since this also was unsuccessful, the rectang!€ 
was placed between the diamond and the tf 
angle, a change which greatly increased tht 
difficulty of the item. The original sphere 01 
the third side of the box was also replaced bY 
a diamond, which increased the difüculty © 
this item as well, 

The Light Test was so interesting to ue 
subjects that the length of the test could t 
increased without much risk of loss of inte! 65" 
Two methods were tried for adding tO 
number of the items. In the first method, €4€ 
side of the box was presented twice, before 
the next side was turned to the subject. This 
produced highly uneven scores, the second 
item on each side being considerably easie! 
PA Ws iret In the final revision, the sides 
Were presented in the order of difficulty, the 
entire procedure being repeated twice. 

‚ The Three-Button Test was revised fout 
times in the course of the pilot studies, In its 
original form, in which only single buttons 
were used, it was found to be too easy for 
purposes of discrimination, Introducing two* 
button items did not increase the difficult 
sufficiently. On the other hand, proceedin£ 
directly from single buttons to three-butto™ 
items made the test too difficult. In the fin? 
form, three single-button presentations we" 
used, followed by two two-button items, W' 
one three-button item as the final presentati? 


| 
| 
| 
| 
| 
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Rejected Tests 


The Sounds Test, a test of shape discrimi- 
nation, consisted of a gray board like that 
used in the Buzz Test, but 2 inches longer. 
Projecting from the board were four knobs: 
an octagon, a triangle, a gear, and a star. 
Pressing the octagon caused chimes to sound, 
the triangle rang a bell, the gear produced a 
buzz. Pressing the star caused no sound. The 
subject was shown, in pretest demonstrations, 
the results of pressing the various knobs. He 
was then asked to reproduce the sound, being 
instructed. "to make ting-a-ling," "to make 
buzz," and so on. For the star, he was asked 
"to make sh-sh," the examiner putting a finger 
to his lips to indicate silence. Various patterns 
oi sounds were used during the pilot studies. 

A number of special difficulties arose. One 
problem was the unwillingness of the subjects 
to shift from one knob to another. This was 
apparently not due to а perseverative tendency, 
since the subjects seemed to be able to shiít 
when they elected to do so. However, they 
liked the sounds, and, having produced one, 
they tended to repeat it with evident enjoy- 
ment, regardless of instructions. 

In an effort to force the shift, cut-off but- 
tons were installed, so that the examiner could 
shut off the sound after the response was 
secured. This tended both to confuse and to 
anger the subjects, who often pressed harder 
and harder on the knob, apparently to *make 
it work" In the next attempt, the examiner 
demonstrated the next response immediately 
after completion of the previous one, thus di- 
recting the child's attention to the new sound. 
This was successful in facilitating shifts, but 
reduced the test to a simple imitation problem. 

All attempts to secure responses based on 
sound patterns were apparently beyond the 
abilities of the subjects, even when only two 
sounds were used at a time. The test was 
finally abandoned as too difficult for purposes 
of discrimination for this population. It was 
tried out experimentally with a group of mod- 
erately retarded subjects, for whom it appeared 
to be highly suitable. The subjects enjoyed 
the test, and were able to handle simple sound 
patterns. A transfer form based on reversals 
of the original patterns also appeared to be 
feasible. Since the test can easily be used for 
presenting patterns ranging from the very sim- 
ple to the highly complex, it might serve in an 
upward extension of this method of testing. 


The Rattle Test was a test of imitation, in 
which subject and examiner used comparable 


sets of material. The subject was required to 
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select the correct material, and use it in imita- 
tion of the examiner's demonstration. The 
materials included a rattle, a small pillow, a 
hairbrush, and a cup. Aíter each demonstra- 
tion, the subject was asked to “make rattle, 
rattle, гае,” “to go to sleep," and so оп. А 
correct response required that the subject indi- 
cate the material to be used, and the part of 
the body with which its use was associated, i.e., 
the head for the pillow, the mouth for the cup, 
and so on. 


This test was so appealing to the subjects 
that it was retained in various forms until the 
last pilot study. Unfortunately, the revisions 
did not succeed in making it sufficiently useful 
for purposes of discrimination. When the ma- 
terials were presented singly or in groups of 
twos or threes, the test was too easy. When 
more complicated patterns were attempted, the 
test became too difficult. Similar difficulties 
were found in the various transfer forms. Re- 
versals of the order of the materials was too 
easy, while the use of the verbal instructions 
without the demonstrations was too difficult. 

Though the seven revisions failed to result 
in an acceptable form of the test, it is still 
thought to be potentially useful because of its 
exceptional appeal to the children. It is also 
thought to have possibilities for other groups 
of retarded subjects. Limited experimentation 
suggested that presentation of the materials 
singly and in small groups would be suitable 
for younger severe retardates, while more 
complicated patterns seemed to be appropriate 
for the moderately retarded. The test was 
highly successful in holding the interest of 
both groups. 

The Bang Test was also a test of imitation. 
The examiner struck the table an increasing 
number of times, saying, "Bang!" at each 
strike. The subject was then asked “to make 
bang!" or "to make bang! bang!" and so on. 
The same procedures were repeated for the 
transfer form, using the vertical surface of a 
wall instead of the horizontal tabletop. Various 
items in different orders were tried, the num- 
ber of strikes ranging from one to seven. 

The test presented several problems. A ma- 
jor difficulty was a marked tendency on the 
part of the subject to respond with random 
banging, a tendency which training did not 
often overcome. However, the most serious 
problem was that a number of the subjects 
disliked the test. Although they appeared to 
enjoy the demonstrations, they did not want 
to imitate them. In the case of subjects with 
whom the training was successful, another 
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type oi difficulty arose: some continued to 
strike or bang the material of subsequent tests, 
disrupting the testing situation. 

The Drum Test and the Tambourine Test 
were attempts to retain the method of the pre- 
ceding test. The items in the Bang Test were 
repeated, first by beating against a drum with 
the hand, and later by striking a tambourine. 
In both cases, the subjects did enjoy the test. 
However, their tendency to respond randomly 
was not lessened, and attempts at training 
tended to increase it still further. 

The Bell Test was the most successful of the 
attempts to utilize the principles of the Bang 
Test. A domed white bell was used, in which 
a clapper hit the metal side of the bell when 
the projecting top was struck. The bell was 
very appealing to the subjects, though several 
revisions were necessary in its construction 
before it was suitable. A flat white disc raised 
well above the bell's dome was used in place 
of the original top, which was too small to 
attract the subjects' attention. More successful 
was a black dise, set high above the dome. 

Тре use of the bell decreased the tendency 
to respond randomly, although it did not over- 
come it entirely. However, now that a greater 

number of correct responses were secured, 
another problem came into evidence. Subjects 
who had learned rudimentary counting per- 
formed better than subjects who had not. The 
test was therefore abandoned, since successful 


performance $0 evidently depended on differ- 
ential past experience, 


The Egg Test was a test of size discrimina- 
tion. A lure was placed in the largest of à 
series of wooden eggs of various sizes. This 
egg was then presented to the subject d 
different position for each item, the eggs i) 
ing used in increasing numbers. Each be 
was presented resting on its base, tapered en 

ward. | 
kc the first form of the test was found to 
be too easy, decreasing the differences between 
the sizes of the eggs was attempted. T 
change made the test too contusing, peu. 
for the brain-injured subjects. Several attempt 
were also made to present the egg in a Иш 
zontal position. A gray board with oval La 
dentations was used to prevent the eggs fr on 
roling out of position. These attempts ie 
not successíul. When the eggs were presens 
so that the line from base to apex was paralle 
to the edge of the table, the test was too сне 
On е other hand, the test became too HN 
when either the tapered ends of the eggs А 
their bases were presented toward the subject 
Several transfer forms were attempted, пы 
black eggs of the same sizes as the white. des 
and various other shapes of corresponding 1€“ 
tive sizes. | " 

The test was finally abandoned as it beca! И 
increasingly evident that the Buzz Test w * 
more suitable size discrimination test tap 
population, and was also easier to administer. 
However, the Egg Test was attempted n 
first form with a group of younger seve! Ti 
retarded subjects, for whom the Buzz Test be 
too difficult. Results suggested that the, E 
Test may be suitable for tls younger subje 
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| Em investigations of sleep deprivation 
arose mainly from an interest in the 
influence of fatigue on performance. Ex- 
cept for a few studies such as the remark- 
able account of a 10-day vigil of a man who 
believed sleep to be unnecessary (Katz & 
Landis, 1935) this early work was rather 
inconclusive with respect to the effects of 
` sleep loss on performance. А review of the 
carly literature may be obtained from Ed- 
wards (1941), Katz and Landis (1935), 
Kleitman (1939), and Laslett (1928). 

Not until recently (Kornetsky, Mirsky, 
Kessler, & Dorff, 1959; Wilkinson, 1958, 
1959; Williams, Lubin, & Goodnow, 1959) 
have really adequate methodology and sta- 
tistical treatment of results been applied to 
the problem of sleep loss and performance in 
human subjects (5s). Wilkinson's excellent 
studies are particularly noteworthy for the 
large number of Ss employed and for the 
variety of the situations studied. One of 
his most significant findings appears to be 
the hitherto unrecognized importance of 
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feeding back information to the sleep- 
deprived S about the scores that he is 
making. Without such feedback in a serial 
reaction test, performance was considerably 
impaired under the condition of sleep de- 
privation, whereas with feedback this im- 
pairment was practically eliminated. This 
finding is well established in two separate 
studies by Wilkinson (1958, 1959), and in 
the latter study, Wilkinson (1959) соп- 
cludes that although inadequate incentive 
tends to yield poor performance even in 
normally rested persons, the need for ade- 
quate (and frequent) incentive becomes 
disproportionately great in those who have 
been sleep deprived. 

Animal Studies. Employing rats as Ss, 
the effects of enforced wakefulness on 
learning and performance in a water maze 
were investigated by Bunch, Frerichs, and 
Licklider (1938), and by Licklider and 
Bunch (1946). The 1938 experiments with 
periods of enforced wakefulness of 8, 16, 
24, and 48 hours, produced rather surpris- 
ing results, showing that in the case of all 
learning criteria (trials, time, and errors) 
the wakeful rats were superior to the con- 
trols. In the 1946 study, the animals’ sleep 
was restricted to 4 hours per day, one group 
of wakeful rats undergoing 123 days of 
experimentation and a second group of 
wakeful rats undergoing 73 days of ex- 
perimentation. Sleep deprivation again ap- 
peared to have facilitative effects although 
performance rather than learning appeared 
to be affected, and the authors stated (p. 
346) : “the better performance of the wake- 
ful rats is largely to be accounted for in 
terms of their more aggressive behavior.” 
In their conclusions the authors stressed 
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that the restriction of the animals' sleep 
made them “highly irritable and highly 
reactive." 

Nauta (1946), who produced sleepless- 
ness in rats by making lesions in the preoptic 
nucleus, reported that after a period aver- 
aging 3 days "the exhausted animals fell 
into a state of coma which soon ended in 
death" (р. 303). It appears that sleepless- 
ness produced by lesions may accelerate 
collapse, because Licklider and Bunch 
(1946) reported that rats in good health 
collapsed in 4 to 18 days without sleep. 
Before Nauta’s rats collapsed, however, 
they too showed irritability, restlessness, 
and vigorous reactions to minor stimuli. 
Nauta likened some of this behavior to 
sham rage. 

Sensory Thresholds. Several studies have 
been concerned with determinations of sen- 
sory thresholds under conditions of sleep 
deprivation. The length of the deprivation 
period varied from study to study, and in 
certain studies thresholds were determined 
after different periods without sleep. Cooper- 
man, Mullin, and Kleitman (1934) in а 
study of cutaneous sensibility carried out 
with von Frey hairs, found that reports of 
pain occurred with weaker stimulation dur- 
ing a vigil of 60 hours. From their article 
it appears that this increased sensitivity to 
pain was not observed during the determina- 

tions made after 24 hours of sleeplessness ; 
and this finding is in agreement with that 
of Schumacher, Goodell, Hardy, and Wolff 
(1940) that 24 hours' sleep deprivation pro- 
duced no change in the thermal pain thresh- 
old. Except for the increased sensitivity to 
pain, and decreased sensitivity to sour taste 
( Furchtgott & Willingham, 1956), changes 
in sensory thresholds have not been ob- 
served following sleep 1055. Negative find- 
ings were obtained with touch by Cooperman 
et al. (1934) after approximately 60 hours’ 
deprivation. with audiometric tests by Good- 
hill and Tyler (1947) after 50 and 100 
hours' deprivation, and with limens for salt 
and sweet taste by Furchtgott and Willing- 
ham (1956) after 24-, 48., and 72-hour 
periods of sleep deprivation. 

Electroencephalographic and Other Phys- 
iological Changes. EEG changes in the 


sleep-deprived S were, as far as we know, 
first studied by the group working with 
Kleitman. Among other things, they noted 
that when the S was engaged in a mental 
task requiring his concentration there ns 
“а great discharge of beta waves "ee i 
replaced the delta rhythm (Blake, Gerard, 
& Kleitman, 1939, p. 55). 

Tyler, Goodman, and Rothman (1947) 
investigated the effect of sleep deprivation 
on the percentage of time that. cach ped 
quency occupied in the EEG tracing. Thes 
workers were careful to exclude from their 
measurements any segments of records 
taken when the S was suspected of dozne 
The authors concluded that at 75 hours i. 
sleep deprivation there was a shift ш 
percentage of time occupied by the various 
frequencies such that less time was т 
by frequencies around 9-10 cps, and а 
time was occupied by the higher frequencies 
(11-17 eps and 18 eps and higher). Conr: 
recordings were taken 3 days prior to th 
experimental period, and were compare 
with recordings taken under three separa 
sleep deprivation conditions : 50, 75, and 
hours, with a different group of В he 
serving in each of the three conditions. 10 
authors concluded that the decrease in 27 ae 
срз activity and the corresponding incre? 
in the higher-frequency activity was asso E 
ated with the effort exerted by the 5s 
staying awake. This conclusion was P3 
on their observation that similar change? E 
the EEG were found to occur while the >> 
were engaged in problem solving. 

jerner's (1949) ERG 
concerned with a different aspect of the 
sleep-deprivation phenomena, Instead 0 
excluding EEG measurement during brie 
periods of sleep as Tyler et al. had done: 
Bjerner studied these inadvertent “саар.” 
His five Ss performed serial reactions with 
an apparatus specially designed for this 
study. The major part of the data analysi§ 
consisted in locating instances of gross laps? 
in performance (delayed actions) on th? 
graphic tracings of performance, and фе? 
marking off the corresponding parts of t 
EEG and heart rate tracings for measur 
ment. In this EEG analysis Bjerner core 
cerned himself only with the number 
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study was main) 
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alpha waves counted (a) during the period 
of delayed action, (b) for the 3 seconds 
before, and (c) the 3 seconds after the 
delayed action. Using the 3-second periods 
before and after delayed actions as basis 
Гог comparison, Bjerner found a signifi- 
cant drop in the number of alpha waves 
during delayed actions. 

It is important to recognize that Bjerner’s 
procedure was radically different from that 
of Tyler and his co-workers whose measure- 
ments were confined to parts of the vigil 
when the experimenters (Es) were certain 
that the S was not dozing. Bjerner’s obser- 
vations, on the other hand, were confined 
to periods when the S had dozed off during 
the performance of the serial reaction test. 
Bjerner’s protocols actually noted the occur- 
rence of snoring during some of the periods 
of delayed actions when the tracings to be 
measured were being taken. Accompanying 
the significant fall in number of alpha waves 
from the EEG during delayed actions was 
a significant transient fall in heart rate. 

In a recent investigation Armington and 
Mitnick (1959) reported a progressive re- 
duction in the amount of alpha (8-12 cps) 
activity with sleep deprivation. In the in- 
terpretation of these results, Armington and 
Mitnick referred to the work of Stennett 
(1957a) in pointing out that low alpha may 
be indicative of either a high or low level of 
activation, and they interpreted the reduc- 
tion of alpha in their experiment as being 
due to drowsiness or a low level of activa- 
tion. 

Finally, an observation concerning the 
skeletal motor system may be mentioned. 
Tn a study in which he and his wife served 
as alternative S and E, Freeman (1932) 
noted increased quadriceps tonus on days 
following serious sleep loss. 

Hallucinations. Numerous authors have 
mentioned hallucinatory experiences in 
sleep-deprived Ss, but unfortunately the 
number of well described cases in which 
hallucinations definitely occurred is very 
small. With regard to hallucinations Tyler's 
(1947) investigation is unusually detailed. 
Of the total number of 588 5$ studied, 444 
Ss were deprived over 60 hours (with 275 
Ss deprived as much as 112 hours). Some 


of the results of this study were complicated 
by administration of drugs (benzedrine sul- 
phate and barbiturates) and by an ex- 
tremely strenuous program of exercise. 
After stating that in almost all cases the 
psychological changes observed were only 
mild ones, Tyler wrote as follows: 

In a few cases the psychological changes which 
occurred were not so mild. These developed dur- 
ing the first 48 hours of sleeplessness and while 
they were still receiving placebos. The first such 
instance occurred just before dawn of the third 
day during a 112-hour experiment involving 35 
men. This subject became extremely aggressive, 
developed delusions of grandeur (imagined himself 
on secret missions for the President, etc.), started 
unprovoked fights, became unmanageable, and had 
to be forcibly restrained. He was put to bed and 
slept through the entire day. On awakening he 
had no recollection of the events of the previous 
night. An examination of his record disclosed no 
indication of any psychotic tendencies during his 
training period; he was well liked, and considered 
quiet and reserved by his friends. 


Six other such cases occurred during the course 
of these studies: two were characterized by ex- 
treme irritability and aggressiveness requiring re- 
straint; four cases by persistent auditory and 
visual hallucinations of such a nature that it was 
deemed desirable to have the men drop out. As in 
the case described above, none of these six men 
had a previous history of psychotic tendencies and, 
following sleep, all symptoms and all recollection 
of the preceding events disappeared (pp, 258-259). 

As far as we know there has been no 
systematic study of physiological functions 
other than EEG and heart rate in con- 
junction with performance under conditions 
of sleep deprivation. Furthermore, with the 
possible exception of the work of Tyler et al. 
(1947), there have been no investigations 
made of physiological changes through the 
vigil under conditions which force the S to 
suppress sleep and to keep performance 
at or near the level of which he is capable 
in the absence of sleep loss. 

The experiments to be described presently 
were designed to fill this gap. They con- 
tained feedback-of-information features to 
counteract lapses in the S's tracking per- 
formance due to brief periods of sleep. 
Wilkinson's (1958, 1959) experiments show- 
ing the importance of feeding back infor- 
mation to a sleep-deprived S about his 


scores were noted earlier. Although the 


results of Wilkinson's experiments were not 
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known during the design of this study, the 
E communicated with the 5 at the end of 
every trial and informed him what his score 
was on that trial and how that score com- 
pared with his best score up to that time. 
Two other means of informing the 5 con- 
cerning his errors (heaters on his legs that 
approached painful intensity with accumula- 
tion of gross errors, and the intensity of the 
tone in the earphones which was propor- 
tional to the S’s error) were employed to 
further decrease the liability of the S to 
lapses caused by a drift toward sleep. 

To summarize, the main purpose of the 
present investigation was to observe changes 
in physiological and performance measures 
as a function of sleep deprivation under 
conditions designed to be highly alerting, 
viz., with the Ss instructed to guard against 
falling asleep, even for brief intervals, and 
with various techniques involving continu- 
ous feedback of information concerning 
performance employed to keep alertness at 
a high level. 

Data relevant to the matter of changes in 
pain thresholds with sleep deprivation were 
collected in the course of these experiments 
and finally, the relation of body sway to 
sleep deprivation was also studied, 


Метнор 


Subjects 


Three young healthy male Ss were em- 
ployed in the experiments, S, 
28-year-old clinical PSychologist on our 
laboratory staff, 5, a 26-year-old graduate 
student assigned to our laboratory for the 
summer, and $, a 25-year-old third-year 
medical student who had volunteered to 
serve as S after hearing about our experi- 
ments. Prior to his learning sessions on 
tracking before the vigil, this © had not been 
in our laboratory. 5, and S^ had served in 
pilot studies with very long sessions and 
they were highly practiced in tracking. All 
three Ss were well paid for serving, and 
they all assumed their responsibilities with 

quite evident seriousness, 


was a 


Medical examination prior to the experi- 
ment established that all were in excellent 


physical condition and, during the experi- 
ments, the Ss were examined regularly by 
а physician. Except for a mild digestive 
upset in S, on the last day of his vigil 
(which, in the opinion of the physician did 
not require stopping the experiment) the 
experiments were uneventful from the medi- 
cal point of view. S, was given one tablet 
of gelusil and one tablet (50 mg.) of gravol 
( dimenhydrinate) following breakfast after 
which his symptoms of mild nausea dis- 
appeared within an hour. At lunch time 
the same symptoms which were reported 
in the morning (mild nausea and lack of 
interest in food) returned. Approximately 
one-half of his lunch was left. These зутр- 


toms disappeared within one-half hour 
following administration of the same 
medication. 


Physical examination revealed a hearing 
loss in S, which was subsequently investi 
gated more closely because of its importance 
for auditory tracking. From comparisons 
of the S’s lower limen at 1000 cps with 
those of three observers with unimpairec 
hearing, it appeared that the S’s hearing 
loss for this frequency was in the neighbor 
hood of 5 db. To compensate for this 1055. 
the level of the tracking tone was raise 
approximately 5 db, 


Apparatus 
Tracking Apparatus 


With certain modifications, 
this study was i 


: is identical with that previously de- 
rege (Davis, Stennett, & Quilter, 1957). For 
us study, however, the direst cue was 
тн. А , directional cue 


either direction pro- 


duced а 10 the earphones worn 


recording the 5 
ask was a simple 
ary for the $ to turn 
{о the left until he reached 2 
and then let the knob return slowlY 
ainst the other stop. This patter? 
Was repeated cyclically. As a further means © 
being sure that the 5 was constantly alert, he wat 
asked to press his right foot down on a peda 
each time that he came to one of the mechanic? 
Stops. Ву pressing down on the foot pedal the 


| 


> 
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closed a microswitch which caused a pip to appear 
on the polygraph tracing. 

Time-off-target (TOT) was recorded by di- 
rect-reading SET clocks and also graphically by 
means of an integrator with a 4-ѕес. discharge 
rate. The instrument was calibrated such that 1 
sec. off-target produced a deflection equal to 1 em. ; 
TOT was registered whenever the amount of 
error was approximately equal to or greater than 
0.58°. Using the same kind of integrator, distance- 
off-target (DOT) in fractions of a degree was 
also recorded graphically, The amplitude of de- 
flection per degree off-target was controlled by 
attenuators in the error-recording circuits. The 
attenuator settings were varied in accordance with 
the S’s level of performance. From calibrations, 
special scales were constructed for measurement 
of the DOT tracings. 


Physiological Recording Apparatus 


For recording physiological tracings and error 
data, three chart drives with ink-writing gal- 
vanometers were used. Chart drive по. 1 was 
part of a custom-built Edin EEG-E IG which 
recorded the primary tracings of F and clec- 
tromyograms (ЕМСѕ) (paper speed: 25 mm, рег 
sec). Electronic integrators, employing the con- 
densor charge-discharge principle and connected in 
parallel with the galvanometers of this EEG-E 1G 
unit, summated the muscle potentials over succes- 
sive 4-sec, periods. The spike tracings from these 
integrators were recorded on chart drive no. 2 
(paper speed: 1 mm. per ѕес.). High-pass filt 
designed to exclude frequencies below 20 eps were 
used for EMG integrations (where Ше modal 
frequency is of the order of 40 to 50 cps). For 
quantification of the EEG data two band-pass 
filters with fairly sharp cut-offs at the upper and 
lower limits of the 2-4 cps and 8-12 eps frequency 
bands were used in connection with integrators, 

Chart drive по. 3 was used to record heart 
rate, respiration, systolic blood pressure, palmar 
conductance, DOT, and TOT. Heart rate was 
recorded by means of a modified Edin electro- 
cardiotachograph (ECTG), and respiration was 
recorded with a Phipps and Bird pneumograph and 
recording tambour. An attempt was made to 
record systolic blood pressure continuously by 
means of an instrument copied with certain modi- 
fications from the design of Burns, iott, and 
James (1955), Unfortunately, however, the trac- 
ings from Sa were not satisfactory and after Ses- 
sion 6, they were discontinued in the case of this 5 
Method of recording and measuring palmar con- 
ductance was similar to that described by Stennett 


(19570). 


3 Some important details of the apparatus for 
recording palmar conductance and some advances 
in recording technique are described in an wn- 
ished paper by Malmo and Davis. 


publ 


un 


Bipolar sponge electrodes, soaked in electrode 
jelly and attached to the 5 by lastonet bands, were 
used for all EMG and EEG placements. Lead 
placements for Sı and S: were as follows (EMGs 
{тот neck and biceps muscles were omitted for 
х): For the forehead-muscle recording, electrodes 
were placed 2 in. from the midline (4 in. from each 
other) and approximately 1 in. above the eyebrow. 
In recording from Ше muscles on the left side of 
the neck, the second cervical spine (found Бу 
counting from the prominent seventh cervical 
spine) was located and marked. A point 13 in. 
outward {тот the second cervical spine (perpen- 
dicular to the backbone) was marked for the 
placement of the first electrode. The second 
electrode was placed 12 in. downward from the 
first electrode and § in. to the left of the midline. 

For recording from the pronator teres muscle, a 
line was first constructed across the ventral surface 
of the arm from medial epicondyle to lateral 
epicondyle. The center of the first electrode was 
placed 1 in. medial to the midpoint between the 
two bony landmarks. The second electrode, its 
center 2 in. from the first, was the vertex of an 
equilateral triangle whose base lay along the elbow 
crease and whose vertex in the longitudinal plane 
lay along the midline of the arm (approximately 
13 in. directly below the cubital fossa). In this 
study, recordings were taken from the left (“іпас- 
tive") arm as well as from the right arm that 
was engaged in tracking. 

In placing the first right-biceps lead, a point 
was marked one half of the distance from the 
anterior fold of the axilla to the cubital fossa, and 
the center of the first electrode was placed 1 in. 
proximally from this point (on the line between 
the two landmarks). The center of the second 
electrode was placed on the same line 2 in. distally 
from the center of the first electrode. 

In placing the first electrode for the left leg, the 
distance from the head of the fibula to the lateral 
malleolus was measured, and the point which 
bisected this line was marked. The center of the 
first electrode was placed at this point. The center 
of the second electrode was placed on the same 
line, 2 in. proximal to the first. 

Placements of all muscle leads except pronator 
teres are illustrated in a mimeographed manual by 
Davis (1952), and other technical points are dealt 
with in this manual. For EEG recording, bipolar 
sponge electrodes soaked in electrode jelly were 
attached to the $ by lastonet tie bands, and these 
electrodes were placed two thirds of the distance 
from nasion to inion, and 3 cm. from the midline 
on each side. ECTGs were recorded by means of 
electrodes placed on contralateral limbs. 


Heat-Pain Stimulation as a Means 
of Increasing the Subject’s Motivation 


In order to increase the S's incentive to track 
well throughout the vigil, heater coils were con- 
nected to the legs which in turn were wired into 
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a circuit activated by the error relay. Whenever 
the error score became disproportionately high, 
enough heat was generated by these coils to be- 
come actually painful. A detailed description of 
this apparatus has previously been published (Sur- 
willo, 1958). 


Other Apparatus 


During part of the time that the S was not 
tracking, pain-threshold determinations were made 
and measurements of postural sway were recorded. 
The apparatuses employed in these two procedures 
were as follows: 

Modified Hardy-Wolff Pain-Threshold Appara- 
tus. Pain-threshold measurements were made with 
laboratory-constructed apparatus essentially the 
same as that used by Andrews (1943). The 
standard 3-sec. period of stimulation was provided 
by a shutter controlled in turn by a Hunter timer 
which was specially calibrated in our laboratory. 

А 500-w., 120-v, Westinghouse T20 clear projec- 
tion lamp (C13 construction) with medium, 
prefocused base, supplied the stimulus. This lamp 
was operated in the base-down position from the 
AC line through a voltage stabilizer and con- 
trolled by a Variac transformer, Each stimulus 
was measured by a wattmeter connected in the 

lamp circuit. 

The lamp was mounted inside a bullet-shaped 
metal housing with an aperture of 1j-in. diameter 
in the nose. Two Bausch and Lomb lenses, 165.1 
mm. in diameter, 31.9 mm. in thickness, and with 

2540-mm. (single lens) focus were combined (flat 
suríaces outward), and placed between the lamp 
and the aperture so that the focal length of the 
combined lenses was approximately 6 in. Two 
rounded plastic stops, 1 im. long and à in. in 
diameter at their ends, protruded from the nose. 

A very large area of the S's forehead was 
blackened with Higgins drawing ink and allowed 
to dry well. After he had been informed concern- 
ing the threshold-taking procedure, the S placed 
his forehead against the Stops in such a way that 
the focused light would fall on the blackened por- 
tion of his forehead. The E turned the wattage 
down to a low value and closed a switch which 
activated the Hunter timer. The timer caused the 
shutter to open and close allowing the light to 
fall on the S’s forehead for exactly 3 sec. For 
the first stimulation the wattage selected was one 
which almost always elicited the response “warm” 
from the S. Following his verbal response, the 5 
withdrew his head from the Stops while the Е 
increased the Variac setting to a point where the 
wattmeter read 20 w. higher than on the previous 
stimulation. After 15 to 20 sec. the $ was asked 
to place his forehead against the stops again, shift- 
ing the position slightly from where it had been on 
the previous stimulation. When the $ was in posi- 

tion and had stated that he was ready, the Е 

presented the second stimulus for 3 sec. and re- 


corded the S's response. This procedure was 
repeated with the S returning his forehead es 
the stops each time so that the area of ==: 
stimulated was not the same on two successive 
occasions, and with the E advancing the inten 
by 20 w. from stimulus to stimulus until the pes, 
response of pain was elicited from the Sa It i 
previously been explained to the S that the р “A 
cedure was by no means a test of his ability р 
endure pain, but rather was a means of ee 
ing what intensity of stimulation was соне Ў 
him to perceive the first trace of pain. d 
also explained that it was especially important : 
the 5 to concentrate during the latter part o 
stimulation because the slight pricking or sing ae 
feeling, signifying that the pain threshold, Se 
been reached, appeared at the very end o ЫШ 
3-зес. stimulation period. At least two и | 
determinations were taken at each sitting but uo 
determinations were invariably made unless the 
first two threshold values were extremely close. 


All Ss were given practice on the threshold- 
taking procedure and had thresholds taken ale 
to the vigil During the vigil, thresholds bes 
taken during the midafternoon of the first, secon n 
and third days, and early in the morning (ерисе 
5:00 a.m. and 6:30 л.м.) of the second and thir 
days. Pain thresholds were taken following sleep 
after the vigil in all cases, In consideration © 
the S's convenience, however, it was possible to 
test only 5. on the same morning after his sleep 
following the vigil. у 

Ataxiameter, This apparatus was ап adaptatio” 
of Hull's (1933) Postural Movement Recorder: k 
clip with thread attached was fastened to the back 
of the S's collar. This thread was led throug s 
a protective metal tubing on to a pulley and thena 
attached to an upright metal pointer whose ba S 
was held in a metal track. Next, the thread Ee 
led over a second pulley whence it ran vertical P 


A weight was attached to this end to keep thé 
thread taut. When 


pointer moved 
leaned backwa: 
The 5 was posi 
rested midwa 


were fitted into the track, 
red one. The blue pointe 
the maximum excursions of the red pointer (maxi- 
mum forward and backward movement of the 5). 
е The 5 stood with eyes closed and with feet in 
V" position, at an angle of 30° with each other. 
After being instructed to keep his body as still a5 
possible е 5 stood for 2 min., rested for 30 sec« 
and then stood for 2 min, again. The scale loca” 
tion of the red pointer was recorded every 10 sec 
together with the maximum left and right (blue 
pointer) readings at the end of each 2-min, perio 


rS were used to record 


у 


CHANGES DURING SLEEP DEPRIVATION 


TABLE 1 


SCHEDULE OF TRACKING SESSIONS DURING VIGILS 


Session No. Time Scheduled Day 

1 11 A.M. Tuesday 
2 3PM. 

3 9 р.м. 

4 3 A.M. Wednesday 
5 11 A.M. 

6 3 P.M. 

7 9 р.м. 

8 е 3 A.M. Thursday 
9 11 A.M. 

10 3 PM. 


Ataxiametric sessions were scheduled two hours 
apart throughout the vigils, and except for some 
occasions when delays occurred, these sessions were 
kept fairly close to schedule. 


Procedures 


From the S's point of view the vigils were 
structured mainly by the 10 l-hour tracking ses- 
sions (see Table 1) and by meals. Five-min, track- 
ing trials were employed, with 1-min. rest intervals 
between trials. There were 10 such trials in each 
experimental session. After each trial the Æ in- 
formed the S what his score was on that trial and 
how that score compared with his best score up 
to that time. In the case of S: and Ss, at the end 
of every session, full recordings were made during 
a 5-min. rest period in which the 5 remained seated 
in the chair under careful observation to make 
sure that he remained awake all through the period. 
In drawing up the master plan for all activities 
during the vigil, two hours prior to the times 
shown in Table 1 were allowed for precalibrations, 
the attachment of electrodes and other pickups to 
the 5, and presession trial of all recordings. One 
hour was the minimum time for a session, but it 
was expected that most sessions would run slightly 
longer due to occasional stops to adjust or change 
faulty electrodes, or to make minor adjustments 
in the apparatus to ensure first-class recordings. 
А whole hour after the experimental session was 
set aside for removal of electrodes and attach- 
ments from the S, and for postcalibrations. Сег- 
tain electrodes that could be worn for long periods 
without discomfort to the $ were left on. Despite 
the large number of measures recorded it was 
possible to keep rather close to this schedule. 

Meals from the institute's Diet Kitchen * were 
served to the S in the laboratory room. Breakfast 


E Р 
ате is a pleasure to record our gratitude to the 


dieticians and to other personnel of the Diet 


Kitchen of the Allan Memorial Institute for their 


important contribution to this study, 


S 


was at 8:00 А.м., lunch at 12:30 р.м. and supper 
at 5:30 р.м. In addition to regular meals, morning 
coffee was served at 10:00 А.м. and light snacks 
were served at 2:00 р.м. and 10:00 р.м. The 5 
remained in the same laboratory room throughout 
the vigil except for trips to the washroom and to 
a room adjacent to the laboratory to make an 
occasional telephone call. The experimental room 
was windowless and in order to prevent the S 
from perceiving the changes from daylight to 
darkness outside, he was blindfolded and led by 
one of the laboratory assistants whenever he went 
to the other two rooms, No attempt was made to 
conceal the time of day from the S; the purpose 
of the blindfold was merely to avoid giving the S 
opportunity to perceive changes in illumination 
from the sky. It has occasionally been reported 
that the subjective feeling of fatigue during a vigil 
is lessened at daybreak when the S perceives the 
sky becoming light. 

Regularly following each tracking session two 
graphic rating scales were administered. In each 
instance the S was requested to place a vertical 
line somewhere along an unbroken horizontal line 
that was 20 cm. in length, The question guiding 
the rating of the first scale was, “How much effort 
did you have to put in the task in order to do as 
well as you could?” The guide phrases "consider- 
able amount of extra effort,” and “normal amount 
of effort” appeared at opposite ends of the hori- 
zontal line. 

In the case of the second scale the question was, 
“How do you feel now?” Five guide phrases 
evenly spaced along the horizontal line were as 
follows: “very tired,” “tired,” “as usual,” “fresh,” 
and “very fresh.” The order of the guide phrases 
from left to right was reversed on alternate presen- 
tations of each scale. 

Soon after supper each evening and between 
midnight and 1:00 a.m. on Wednesday and Thurs- 
day, the S took a variety of tests administered by 
clinical psychologists. The purpose of these tests 
was mainly to keep the S actively engaged in order 
to prevent sleep. Between these planned activities 
and procedures the S was free to read, play games, 
or do whatever he pleased (5з spent much of 
his free time playing his guitar). The S was under 
constant observation by one member of the labo- 
ratory or another, and care was taken to have at 
least one well-rested laboratory assistant on duty 
at all times. At no time was the 5 left alone, even 
for a few minutes. 


Treatment of Data 
EEG 


The EEG tracings recorded on one channel of the 
Edin instrument were used for monitoring pur- 
poses. Band-pass filters and integrators were em- 
ployed to quantify the EEG tracings in the two 
irequency ranges of approximately 2-4 cps and 
8-12 cps. The integrated data were separately 
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recorded as deflections every 4 sec. (0.1 cm/sec 
paper speed). 

Using all of the integrator deflections, the mean 
voltage per min. was determined by accumulating 
the number of millimeters in each successive series 
of 15 deflections dividing by the number of deflec- 
tions (usually 15) and by reference to the specially 
calibrated ruler for the gain used in recording, 
reading off the corresponding microvolt value. 

For each frequency range (2-4 cps and 8-12 
cps) five mean l-min. microvolt values were ob- 
tained from each tracking trial and 50 such values 
were obtained from one entire tracking session, 
providing no deflections were monitored out. The 
original tracing was carefully examined in order 
to ensure that no spike was measured if it covered 
а part of the tracing in which some artifact oc- 
curred, Where available (in the case of Sz and 
Sa) the data for the 5-min. rest periods at the 
end of the sessions were treated in the same w: 
as the data for the tracking trials. 

Following measurement of the EEG integrations, 
the separate measurements were combined т 
various ways, The 250 l-min. mean microvolt 
values from the tracking trials in the first five 
tracking sessions were combined to yield a mean 
for the first half of the vigil, and similarly the 
mean microvolt value for tracking sessions in the 
second half of the vigil was also obtained, Mean 
microvolt values were also computed separately 

for each tracking session, for each 5-min, tracking 
trial, and for each 5-min. rest period, 


EMG 


„With the 
right arm, every sixth deflecti 


ay 


, Such that 12 
all were measured for every trial. 
For muscles of the right arm, in order to keep 
the task-induced tension approximately the same 
from measurement to measurement, the following 
deflections were measured: Numbers 4, 12, 19, 27 
34, 42, 49, 57, 64, and 72. MEA 

As in the case of EEG and other phy: 
measures, the main comparison was betw 
values from the first half of the vigil 
values from the second half. 


siological 
een mean 
with mean 


Palmar Conductance 


Measurements of palmar resistance were taken 
regularly every 15 sec. throughout each trial, 
commencing at the very beginning of the trial, so 
that there were 21 measurements per trial. Resist- 
ance values were immediately converted to con- 
ductance in micromhos with a table of reciprocals. 
All calculations and statistics were performed with 


conductance values as recommended hy Woodworth 
and Schlosberg (1954, р. 140). 


Respiration 


Number of respirations counted to the nearest 
quarter respiration was obtained for the first 23 


min, of each trial, for the last 24 min., and for the 
whole trial. 


Heart Rate 


Heart rates were determined from the ECTG 
tracing at 15-ѕес. intervals, these measures сот- 
ciding temporally with the measurements of palmar 
conductance. 


Tracking Error Scores 


TOT. The clock readings showed total TOT for 
each whole trial. Since the 4-sec. graphic integra- 
tions of TOT were available for fractionations in- 
volving parts of trials each deflection was measured 
with a rule calibrated directly in sec. Both TOT 
values were treated to yield a mean TOT score per 
trial for the first half of the vigil versus a mean 
TOT score per trial for the second half of the vigil. 
TOT scores were also obtained for the first half 
and last half of each trial. | 
DOT. Each 4-sec. integrator deflection was meas- 
ured with a rule calibrated in degrees. Mean РОТ 
scores for the first half and second half of the 
vigil were determined. DOT scores were also 


obtained for the first half and last half of each 
trial. 


Palmar Sweating 


Average Micromhos 


EEG Alpha Amplitude 


Average Microvolts 


2 3 4 5 6 7 8 9 10 
Session Number 


Fic. 1. Mean levels of 8-12 cps EEG and 
palmar conductance throughout vigils lasting ар- 
proximately 60 hours. (Each point on curves 
represents mean value for 1-hour tracking session, 
combining the values for all three Уз. Approx!- 
mate time corresponding to each session number 
is given in Table 13 
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RESULTS 


EEG, Palmar Conductance, 
and Respiration 


Progressive EEG Changes during the 
Vigil in Relation to Concomitant Changes in 
Palmar Conductance and Respiration. The 
lower graph in Fig. 1 presents data for 
EEG alpha quantified by means of the 8-12 
cps band-pass filter. The upper graph in 
Fig. 1 presents data from measurements of 
palmar conductance. Each point on the 
curves represents the mean value for one 
entire tracking session. Figure 2 presents 
the corresponding data from respiration 
rate. Table 2 presents a comparison between 
mean values for the first half of the vigil 
versus mean values for the second half in 
the case of each S, and the p values for the 
differences obtained. 

Table 2 indicates that all three of these 
physiological measures showed reliable 
change for each S during the vigil, with 
palmar conductance and respiration higher 
during the last half of the vigil and ampli- 
tude of 8-12 cps EEG lower during the last 
half as compared with the first half of the 
vigil. 

EEG data quantified by means of the 2-4 
cps band-pass filter also yielded significant 
changes for all three Ss from first to last 
halves of vigil. Sı and Ss showed drops in 
amplitude from 5.3 to 3.9 (p < .001) and 
from 3.3 to 3.0 (р < .01) mean microvolts, 
respectively, while S's showed a rise from 
3.7 to 4.0 mean microvolts (р < .001). 


210 


m" 
ө 
a 


Meon Respirations Per Minute 
© 


1 2 3 4 5 6 7 8 9 10 
Session Number 


Fic. 2. Mean rates of respiration throughout the 
vigils. (Points on curves were obtained similarly 
to those in Fig. 1.) 


Comparison of Within-Trial Changes: 4 
Search for Lowered 8-12 cps EEG (“Al- 
pha") Associated with Possible Drowsiness 
and Shift toward Sleep. Accompanied by a 
concomitant rise in palmar conductance and 
respiration, the finding of a progressive fall 
in 8-12 cps EEG activity is strong evidence 
that our Ss were very much awake and alert 
throughout the tracking sessions (Stennett, 
1957a). However, to investigate this matter 
more closely, the data were subjected to 
further analysis. 

An inspection of the 8-12 cps EEG data 
revealed that amplitude tended to fall during 
the trial, and this tendency seemed more 
marked in the second half of the vigil than 
in the first. We present the following 
analysis by Ss. 

S,: The mean difference between first 
and second halves of trials for the first five 
tracking sessions in the vigil was —0.16 uv. 


TABLE 2 


CHANGES IN EEG 8-12 cps AMPLITUDE, PALMAR CONDUCTANCE, 


AND RESPIRATION RATE DURING VIGIL 


5 So Ss 

Ist 2nd 1st 2nd Ist 2nd 

Half | Half Half | Half Half | Half 

of of of of of of 

Vigil | Vigil | Diff. p Vigil | Vigil | Diff. p Vigil | Vigil | Diff. р 
ЭЛ2 EEG (иу) | 12.4 | 9.3 |—3.1 |<.001 6.8 | 5.1 |-1.7 |<.001 91| 1.0 -2.1 |21 
PC (и mhos.) 19.6 | 64.7 | 15.1 |<.001 51.9 | 81.2 | 29.3 |<.001113.2 [139.1 25.9 |<.001 

19.16, 19.57 0.41<.05 | 21.34 6 0.84 < .0011 19.46, 19.83 0.37 <.02 


Resp. (рег min.) | 
| 
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: г i" үбү veen 
whereas the corresponding mean difference sponsible for tie apparent par Mes 
for the last five tracking sessions was 1.64 the data from шарана а 
LY. The minus value for the first five ses- palmar conductance. on 
sions of the vigil indicated a mean rise in $i: This. S also — eem bn m 
amplitude from the first to second halves trial fall of 8-12 cps E dee were 0.82 
of tracking trials. The difference between in the vigil. The ditteretion ps sheer 
70.16 and 1.64 ру. was a highly significant and 1.76 and the difference | e Y Е level 
one (р < .001). Био, маца! pps см im nor 

The data for palmar conductance and However, neither aec ор 
respiration were treated in a similar way respiration yielded est 
as those for EEG. The difference values When treated in a sinnmar " m — 75 
for palmar conductance were —1.08 and Sa: Data from this S fai а posed 
—0.28 for the first five sessions and the last nificant EEG ү w hen нв e 
five sessions, respectively. As is well known, іп the way poe I “ Баш ome differ- 
palmar conductance characteristically falls (ТОТ and DOT) nis $ 8 For TOT the 
during a trial of almost any kind of activity, ences when treated in this я 1 4-187 for 
The data indicated, however, that the fall (difference values were —3.61 anc А 
was significantly less (р < 05) in the latter 
part of the vigil than it was in the first part of 
the vigil, Corresponding difference values for 
respiration rate were 0.51 and -0.29 for the 
first five and last five Sessions, respectively. 
This difference was reliable at the .01 level 


BILATERAL PARIETAL EEG 


respiration in the latter halves of the track- 


finding was 
mar conduct- 
ance, the respiration tracings were studied. 
an irregularity 
which seemed 


breaths which had the effect of 
respiration rate. In the latter part of the 
vigil this S took more deep breaths than in 
the early part of the vigil. Thus in the first 
three sessions there were only 4 large respi- Е 
rations (two times the height of adjacent 
respirations and at least 2 cm. wide at the 
base) compared with 42 such deviating 


? ЕРЕВАН 
Y F ЧЕН + dE 

respirations observed in the last three ses- | | Ой 

sions. The breakdown for first and Second НИ ыы ышы ишн 

halves of these trials lies 0 and 4, 11 and Fic. 3. Apparent disappearance of EEG alpha 

31, for the two fractionations (Sessions 1-3 


А from tlie primary EEG tracing during brief episode 
and 8-10), respectively. Tt would appear, of sleep in a young male 5 during tracking in an 


therefore, that the larger number of these unpublished study by Malmo. (Search for similar 
leviating respirations in the latter half of phenomenon in the records of the present investi- 
( 


> 5 e haya] gation failed, The arrow in this figure and those 
trials late in the vigil may have been re- in Fig. 4 indicate the Same moment in time.) 
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first and last parts of vigil, respectively (the 
plus score of 1.87 indicating a predominance 
of increasing error in the second half of 
trials), and the difference of 5.48 between 
these two values was highly significant 
(.001 level). For DOT the corresponding 
values were —0.009 and 0.003 and the dif- 
ference of 0.012 between these two values 
was significant at the .05 level. Error scores 
from the other two Ss were treated in this 
fashion but failed to yield reliable differ- 
ences in either case. 

Finally, it should be mentioned that a 
similar analysis was carried out on the 
heart rate data from the three Ss but failed 
to yield any significant differences. 

Tracings Illustrating Transition from 
Waking to Sleeping State. Finally, а careful 
search of the tracings was made for the 
purpose of locating instances of EEG 
changes characteristic of the kind which 
occur during a brief episode of sleep. This 
search did not reveal a single such instance. 

In order to illustrate precisely what we 
were searching for, some tracings from an 
unpublished experiment by Malmo are re- 
produced in Fig. 3 and 4. These tracings 
were cut from Trial 9 of a series of tracking 
trials designed to bring the S to a practice 
evel. The EEG lead placements were the 
same as those in the present study and the 
tracking apparatus was similar to the one 
described in the Methods section. As these 
tracings were being recorded, the E sus- 
pected from the pattern of changes occur- 
ring that the S had fallen asleep briefly 
during Trial 9. After the trial had been 
completed the 5 admitted upon questioning 
hat he had indeed “dozed off.” 

Inspection of the record in Fig. 3 reveals 
hat for a period of approximately 20 sec. 
there was virtual disappearance of the high- 
amplitude rhythmical waves with frequency 
close to 10 cps. This change is also seen 
in the integrator deflections in the upper 
part of Fig. 4 (second line). The deflections 
for the 17-30 cps component of the EEG 
(a band-pass filter for quantification of this 
component—Ross & Davis, 1958—was in- 


cluded in this study) also show a drop in 


INTEGRATED EEG 


INTEGRATED ERROR 


hu 11 sec. 


Fic, 4. Further data obtained from the same 5 
as the one whose tracings are shown in Fig. 3. 
(The upper part of the figure shows the secondary 
EEG tracing consisting of the 4-sec. integrated 
outputs of three band-pass filters with the follow- 
ing frequency bands, from top to bottom: 2-4 cps, 
8-12 cps, and 17-30 cps. The lower part of the 
record shows the sudden increase in error during 
the period when the S was dozing. Top line: 4-sec. 
integrations of TOT. Bottom line: 4-sec. integra- 
tions of DOT. The two arrows in this figure and 
the one in Fig. 3 indicate the same moment in 


time.) 


amplitude (third line), while the 2-4 cps 
component appears to show no change. The 
integrator deflections for TOT (fourth 
line) and for DOT (fifth line) give evi- 
dence of a sudden increase in error that 
corresponds rather closely with the drop in 
amplitude of the EEG deflections. Because 
of the phase of the tracking cycle where the 
onset of sleep occurred, nothing should be 
made of the fact that the fall in EEG ampli- 
tude precedes the abrupt increase in error 
by several seconds. Heart rate (not in- 
cluded in the figure) showed a correspond- 
ing decrease during this brief period of 
sleep, followed by a risc. 


Heart Rate 


Mean heart rate for Sı and S, was re- 
liably higher (.001 level) in the second half 
of the vigil compared with the mean value 
for the first half of the vigil. The rises 
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TABLE 3 


CHANGES IN MUSCLE TENSION DURING VIGIL 


A 


Уз Ss 


Mean Microvolts 


Mean Microvolts Mean Microvolts 


Ist 2nd Ist 

Half | Half Half 

of of of. 

Vigil ^ Vigil | Diff. b Vigil 

Forehead 18.5 | 17.9 | —0.6 us 18.8 

Left neck 8.9 | 14.9 6.0 <.001 41.0 

Right biceps 42.6 | 29.9 |-12.7] ns 26 3 

L. pronator teres 11.0 5.9! —5.1<.001. 10.5 

R. pronator teres 60.7 61.8 l.1| as 76.9 
Left leg 6.7 [a 0.5 ns 6.6 | 


————— $e فق‎ 
* Plus differences signify ri 
vigil. 


were from 90.5 bea 
beats per minute 
86.7 for Sy. S, 


drop from a mean heart rate of 84.0 beats 


minute for the second half 
The difference met the .05 
ce, 


EMG 


Specificity of Tensional Increase dur 
the Vigil. Table 3 Presents the muscle t 
sion data fractionated for 
halves of vigil. A differen 
minus sign Tepresents a f. 
sion from the first to the 
vigil. 


ing 
сп- 
first and. second 
ce prefixed by а 
all in muscle ten- 
second half of the 
"pretation of the 
ampered by the absence 


veals an interesting 
pattern in the results, Aq] Ss showed a 
significant increase of tension j 

muscle. The particular musc 
varied from S to S: Sı sho 
tension in the neck muscles, S, in the right 
biceps, and S, in the forehead muscle, 

The neck muscle data from S are plotted 
in Tig. 5. Tt will be seen that the steepness 
of EMG gradient as well ах the level of 
muscle increased 


wed increased 


tension progressively 


se, minus differences signify f. 


l dies | ag 
2nd Ist 2nc 
| Half Half | Half | 
of of | of. "m 
Vigil | Diff p Vigil | Vigil | Dif LE. 
Sloe) 138 1 
1.0; —78<.001 9.3 | 10.8! 1.5 ig 
48.6 7.6 ns | | p 
33.8 7.5 «.01 | > 
4.3 | —6.2,«.001| 12,4| 8.7 [7347 s 
| 41.9 —35.0| <.001| 63.2 | 64.4 | 1.2 E 
5.9 —0.7 ns 29 |. 87 | 08 


H o . ч “Ты half of 
all in muscle tension from first to second he 


throughout the vigil. As would be eX 
pected, the dotted line 
( high-incentive) trial 
vigil has a less regular 
other curves; 


curve for the single 
at the end of the 
appearance than pe 
but it is of interest to eh 
that although this curve has a steeper siom 
than the other curves, the level of must 

tension appearing at the end of this m 
was approximately the same as the de 
value of tension in the p 


curve based on 1 
values from the last tl 


aree sessions. 
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(Data are plotted to permit visua Phirds 
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бт. the 
of the vigil anq for final trial at the end of 
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First Four Sessions 


AVERAGE MICROVOLTS 


TIME IN MINUTES 


Fic. 6. Mean fall in left arm tension during the 
vigil. (Data are plotted in the same way as in 
Fig. 5, but data for all three 5$ are included.) 


Progressive Fall in Left Pronator Teres 
Muscle Tension during the Vigil. Tigure 6 
presents curves for left pronator teres 
ЕМС. These curves were plotted in the 
same manner as those shown in Fig. 5, but 
in this instance the mean muscle potential 
values plotted were taken from all three S's 
As in Fig. 5 the curves show a progressive 
change in gradient steepness and in muscle 
tension level throughout the vigil, but the 
direction of change is exactly opposite to 
that shown in Fig. 5. There was continuous 
fall in level of tension and in gradient slopes 
for the left pronator teres muscle through- 
out the vigil. It should be noted that on the 
special incentive trial at the end of the 
vigil, mean tension and gradient steepness 
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agreed very well with the corresponding 
mean values for the first four sessions. 
That the decrease of tension in this muscle 
was reliable in the case of each S may be 
seen by reference to Table 3. i 


Performance Data 


Table 4 presents the data on tracking 
performance for each S. Amount of error 
for the first half of the vigil is compared 
with the amount of error for the last half. 
It is clear from Table 4 that the three Ss 
were rather different with respect to the 
effects of the vigil upon their performances. 
Sı showed clear impairment; the increases 
in TOT and DOT were both highly signifi- 
cant. On the other hand, 5. showed ап 
improvement in his TOT score that was 
also highly reliable. This S’s DOT score 
showed no change, however. Finally, S;'s 
change in TOT was not significant although 
his rise in mean error score in degrees met 
the .05 criterion of reliability. In summary, 
S, showed definite impairment, 5, only 
slight impairment, while Ss actually showed 
improvement in performance during the 
second half of the vigil compared with per- 
formance during the first half. 


Subjective Reports Concerning 
Tracking Sessions 


Hallucinatory Activity. At the end of 
the last (tenth) tracking session, in his own 


TABLE 4 


CHANGES IN TRACKING PERFORMANCE DURING THE V. 


| 
5; | 5: | Ss 
_ | | 
| | | | | | | | 
|| | : | 
Ist 2nd | | Ist | 2nd | Ist 2nd | 
Half | Half | | Half | Half Half | Half 
of of | of of | | of 
Vigil | Vigil | Diff. | p | Vigil | Vigil | Dif. р Vigil | Diff. | p 
Mean TOT | | | | 
(sec./trial) 64.8 [120.8 | 56.0 |<.001 98.2 | 86.9 |—11.3 <.001109.4 LE l= | ж 
Меап РОТ | | | | | 
(degrees) 0.18 0.31 0.13 <.001 0.16 0.16 0.00 — 0.25 0.27 0.02<.05 


a Plus differences signify rise, minus differences signify 


il. 


second half of vig 


fall in error scores (improvement) from first to 
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handwriting S, wrote the following account 
of hallucinatory activity. 
[I noticed] a funny effect оЁ [the] tone in [the] 
last session. Often I would completely forget the 
task, and the tones would become part of some- 
thing like a dream. Different discrete tones (from 
switching the knob around) became people! And 
there were patterns of them: rows and squares, 
which were interesting but annoying, since I wanted 
to keep attention only on the tracking. 
On the next day following a good night's 
sleep in the laboratory, the S added to his 
account as follows: 
I would have the feeling of moving, making my 
way among the "people" at this time, with no 
auditory sensations at all. I think during these 
times I was still moving the knob because Dave 
didn't say anything to wake me up. 
As a psychologist who realized that under 
unusual conditions (e.g., perceptual isola- 
tion) hallucinations could appear in non- 
psychotic individuals, S, was able to view 
these experiences rather objectively, and 
hence did not seem much disturbed by them. 
S; did not have any visual hallucinations 
but he did report a loss of ability to appre- 
ciate the true shape of the knob by tactual 
and kinesthetic means. During one trial he 
just "blinked" at it to get a glimpse of the 
knob. He said that it was disturbing not to 
be getting sufficient information from kin- 
esthetic and cutaneous senses. This loss of 


cutaneous appreciation of the kn 


ob appar- 
ently 


occurred mainly during Session 5 

when the rotational action of the knob had 
become somewhat stiff. It seemed to the $ 
that during this time he exerted considerably 
more pressure on the knob than at other 
times. 

S, reported hallucinations which were 
inanimate, and were a materialization of a 
visual schema that he had been employing 
as an aid in tracking. He visualized a 
rectangle which represented the error-free 
zone in tracking. Inside this rectangle he 
visualized a black dot which represented the 
position of the brush that he could move 
by means of the knob that he held in his 
right hand. In his mind's eye he visualized 
the rectangle as moving from right to left 
as tracking commenced, and his self- 
instructions were to turn the knob in such 


a way as to keep the dot within the rec- 
tangle. 

Tn the practice trials when he was iens 
ing to track, the mechanism of the tracking 
apparatus was explained to him by € 
of a diagram, and being "visual minde 
(the 5 customarily used visual aids in study- 
ing for examinations), he projected the 
visual image as if seen on a prece of paper. 
Although these images appeared to pava 
clarity verging on the eidetic they a 
in certain important respects from the 
hallucinatory images that occurred in the 
later sessions. Unlike the hallucinations, 
the visual images used as guides had definite 
form and they were completely under con- 
trol. 


i F ry 
The transition from the stage of image ч 
used as visual aid to hallucinations d 
d ns 

smooth one. Apparently the hallucination 
first appeared briefly in Session 8 but e 
not develop fully until Session 9. The d 
began to lose control of his imagery e 
the definite form of the images was sep 
by undulating, wavy, irregular borders wit : 
out sharp angles. The hallucinated Даш 
were two-dimensional ones, projected a 
distance from the eyes as if on a ея 
screen. In his notes the S wrote as follows: 
eq А 4 1а 
The test [Session 9] began in much the bo 
way, but during trial no. 3 ог no. 4 the V їй 
image I have of the tone and which I Ne 
tracking was replaced by indistinct hallucinatio!!* 3 
halves of large objects—and the tracking test ? : 
came a test of outlining these objects. I was rathe 
Surprised to note that my score on the first suc? 
test, trial по. 3 T think, was as good as it was, 


_ or as [the E] said it was, Deep respiration seemed 


to overcome or help me to dispel these ment? 


imageries, and it (deep breathing) had to be main- 
tained voluntarily and with considerable concen- 
tration in order to permit me to visualize the test 
pattern as I have been used to. 

Further questioning revealed that for а 
session or two the S was preoccupied with 
what he thought was a connection between 
the hallucinations and breathing. This even 
developed for a brief time into a delusion 
that the hallucinations were being induce 
by the Es. He thought that he noted some 
connection between the disappearance of thé 
hallucinations and "disturbances" in tP 
control room where the Es were operating 
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the equipment. By "disturbances" the 5 
meant talking. The S had the feeling that 
his deep breathing was undesirable from the 
point of view of the Es (because of the 
artifacts produced in the physiological re- 
cordings), but he stated in his notes that he 
continued to breathe deeply despite this. 
Careful examination of the respiratory 
tracings revealed very little actual deep 
breathing during tracking trials, although 
increased amplitudes of the respiratory 
racings were noted in the rest periods be- 
tween trials їп Session 9, and later in this 
Session the pattern of respiration which had 
previously been very regular became irregu- 
ar. It is likely, therefore, that the S 
exaggerated the amount of deep breathing 
hat he engaged in. 

Again from the S's notes there were the 
following statements made immediately fol- 
owing Session 9 when the hallucinations 
were at their peak: 
mring the tracking there were gross tremors 
which later became smaller to fine, and which are 
only now disappearing; also there was excess 
salivation. 

After the final session (10) the 5 wrote 


as follows: 


| put maximum effort int 
alone, їп an effort to rec 
lucinations, and I very 
consciously to project а 
I saw only а few hallucinations which were similar 
ibed above; when they occurred they 
were transitory and my surprise or fear of seeing 
them was changed as a result of the [explanation 
of how hallucinations may be produced in normal 
individuals under unusual circumstances]. 


The $ emphasized that he seemed to be able 
to ignore the hallucinations in this session 
by concentrating on the sound only. 

Tt is interesting to note that roughly six 
weeks later when this S was interviewed 
again there appeared to be some confusion 
in his mind about the details of the visual 
schema that he had used during some nine 
hours of tracking during the vigil. 

Rating Scales. Figure 7 presents the data 
[rom the rating scales for effort (“How 
h effort did you have to put in the task 
rder to do as well as you could 2") and 
ig of fatigue (“Пом do you feel now: 


о concentrating on sounds 
luce the number of hal- 
seldom allowed myself 
isual image of the sound. 


to those de 


тис 
шо 
feclit 


Mean Millimeters 


very fresh, fresh, as usual, tired, very 
tired ?"). The curves, which represent mean 
values for the three Ss, show the expected 
changes in the direction of increased effort, 
and presence of a tired feeling in the latter 
part of the vigil. S appeared to show а 
“second wind” phenomenon during the sixth 
session which is reflected in the group curve. 
amination of the physiological and per- 
formance data did not confirm this finding. 
Except for a steeper ЕМС gradient for the 
left pronator teres muscle during this ses- 
sion, there were no conspicuous differences. 
The gradient in the right pronator teres was 
quite low, and there was nothing to indicate 
a break in trend in the data for heart rate, 
respiration, palmar conductance, and the 
error scores (DOT and TOT). 

Further Subjective Reports from Proto- 
cols. Sı found it more difficult to stay awake 
during the tracking sessions than at any 
other time. Early in the vigil he stated that 
he was bored and found it more difficult to 
overcome boredom than to stay awake. As 
the vigil continued to the final tracking ses- 


EFFORT 


го а = 6€ 7 E $ 1 
Session Number 


Fic. 7. Mean rise in ratings of effort and feel- 
ing of fatigue. 
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sion, however, his notes indicate that it was 
increasingly difficult to stay awake and that 
he was constantly fighting sleep. Verbatim 
notes from Thursday at 4:15 A.M.: 

Very hard to stay awake during tracking and 
almost impossible to keep eyes shut in between 
trials : It's either open them or sleep. 

This S had the impression that by the eighth 
tracking session he was "getting lost" in the 
second half of the trials. Analysis of the 
error data, however, failed to support this 
subjective impression. 

From his notes, it appeared that S. was 
more conscious of drowsiness when he was 
not occupied (as during lead placement and 
rest periods). When tracking commenced 
his feelings of drowsiness were reduced. 
5.3 notes reflect a strong desire for high 
achievement in this task. He wrote that 
his performance had "deteriorated" during 
Session 5, but analysis of the error scores 
revealed that mean error score for this ses- 
sion was actually lower than the scores had 
been in the previous four sessions. Follow- 
ing Session 8 he wrote: 


It seemed that even though ту thoug 
elsewhere, I was able to track а 
greater or lesser degree, 
conscious of the tone. 


In the last sessions Sz reported “ 


hts were 
automatically to a 
being only minimally 


formance, but analysis of the error data 
revealed that he overestimated the effects 
of sleep loss on his performance. — . 

Except for the reports on hallucinations. 
there was little else of interest in the notes 
from Ss. 


Pain Thresholds 


In each case the mean pain threshold for 
the last day of the vigil was lower than. the 
mean pain threshold for determinations 
made prior to the last day, and {По ШЕ 
sleep after the vigil. The mean threshol 5 
in watts were 360 and 401 for Si e 
ence of 41 significant at the .02 level), 28 
and 290 for S, (difference of 10 not sig- 
nificant), and 245 and 277 for S, (differ- 
ence of 32 significant at the .001 level). 


Ataxiameter Data 


The ataxiameter data in Table 5 show 
that body sway was significantly increase 
in the second half of the vigil in the case 0 
Sı, significantly decreased in the case of n 
and not significantly changed in the = 
of S. The decrease in sway for S: Jr 
significant for one measure (maximal shift 


at the .02 level, and the decrease bY the 


md ut catnaps” other measure (mean shift per 10 sec.) 
uragement regarding his per- barely missed significance at the .05 leve! 
TABLE 5 
CHANGES iN Вору Sway s MEASURED BY THE ATAXIAMETER DURING THE VIGIL А 
| ~ = ج ےے‎ 
Si ¢ 
|. 1 Е Ss 5, 
tæla Чы тт 
st | 2nd | | | dst | 2nd | | Ist | 2nd 
| Half | Half | Half | Half | Halt | Halt 
| of. of | | of of | | | of of 
| Vigil | Vigil | Diffs | p | Vigil | Vigil | Diff, | م‎ | Vigil | Vigil | Diff. b 
— ات ا اپ تد‎ | 
Mean shift/10sec., | | | | | | 
(cm.) 0.57 | 1.30 | 0.73 | <.02| 0.31 | 0.23 |—0.08! пх" 0.22 | 0.26 | 0.04, us 
Mean maximal | | | 
shift/2 min. | | | 
(cm.) 4.93 | 8.91 | 3.98 | <.05 3.30 ' 2.04 —1.26 <.02 2.48 | 2.26 |—0.22| ns 
ااي ووا ا‎ di | ا ا‎ lar 
а Plus differences signify increase 


to second half of vigil, 
# = 2.06, fos = 2.07. 


n " A d at 
, minus differences signify decrease in amount of body sway from fir 
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DISCUSSION 
EEG 

The decrease in amplitude of the alpha 
component of the EEG observed with pro- 
gressive sleep loss in this study is consistent 
with the findings of previous investigators. 
The methods used in quantifying the EEG 
data have, of course, differed from inves- 
tigator to investigator. Tyler et al. (1947) 
used manual methods of analysis and found 
a decrease in the 9-10 cps band. Bjerner 
(1949) also used manual methods but classi- 
fied frequencies of approximately 10 cps as 
alpha activity in the case of three of his Ss 
and 8.5 cps and 7.5 cps, respectively, in 
the case of the other two (pp. 58-59). 
Armington and Mitnick (1959), on the 
other hand, used an automatic digital-type 
method with time as the unit of measure- 
ment. In this case, the frequency band 8-12 
cps was quantified and EEG activity was 
considered to be present only when the 
amplitude of the activity in this band ex- 
ceeded 25 pv. (with 2 Ss, 15 py. was the 
criterion). Despite these obvious differ- 
ences, however, there has been (together 
with the present results) substantial agree- 
ment in the finding that EEG activity in 
the "alpha" band has decreased with pro- 
longed deprivation of sleep. 

Such agreement does not, however, ex- 
tend to the interpretation of these findings. 
Tyler et al. concluded that their results 
were associated with the effort of the Ss 
to stay awake. Djerner, on the other hand, 
concluded quite reasonably that his own 
findings of decreased alpha activity during 
sleep loss were associated with brief periods 
of sleep. Finally, Armington and Mitnick 
interpret their findings of low alpha during 
sleep deprivation as being due to drowsiness. 

Tn previous articles (Malmo, 1958, 1959), 
some of the data reported in the present 
experiment were dealt with in relation to 
activation theory. To summarize the main 
points previously made, depriving an or- 
eanism of sleep appears to be like depriving 
it of other basic needs. Under conditions 
of sufficient environmental stimulation this 
deprivation leads to increased activation. 


That is, judging from the overall picture 


of physiological change during our experi- 
mental vigils, it was concluded that the 
tracking task was sufficiently stimulating 
to interact positively with deprivation to 
produce a significant rise in activation dur- 
ing the 60-hour vigil. 

It is clear that a difficult problem of inter- 
pretation has been raised by these different 
studies. Before returning to a consideration 
of the separate studies, it will be useful to 
recognize one very important general point. 
This is simply that low alpha EEG activity 
can, when it is observed, indicate either a 
high level of alertness (high activation) or 
a drift toward sleep (low activation). This 
to be sure is not a new finding—it has been 
reported in numerous studies. Only re- 
cently, however, have the implications of 
this finding for behavioral studies been 
made explicit (Malmo, 1959; Stennett, 
1957a). We may conclude then that, if 
we consider only the level of alpha EEG 
activity at a given time—and nothing else— 
the question of whether an S shows high 
or low activation is indeterminate. In the 
light of this conclusion it is obvious that 
the apparent issue of interpretation which 
has been raised reduces to a matter of other 
evidence. It revolves around the question: 
Besides level of EEG alpha, what other 
evidence has been brought to bear on the 
matter? Let us examine the various studies 
in turn with reference to such additional 


evidence. 

Although no measures other than the 
EEG were employed by Tyler et al. (1947) 
to assist with the interpretation of the alpha 
findings, these workers did show that ac- 
companying the fall in 9-10 cps activity 
there was a rise in the fast-frequency com- 
ponent of the EEG (1.е., а higher percentage 
of waves in the 11-17 cps band, and in the 
band from 18 cps and higher). This cer- 
tainly is more consistent with the interpreta- 
tion of increasing activation rather than the 
reverse inasmuch as a decrease #ot an in- 
crease in the higher frequencies is to be 
expected with а change in the direction of 
sleep ( Lindsley, 1957, p. 68). 

Bjerner (1949), it will be recalled, did 
not deal with long-term changes in alpha 
but with short-term, transient changes during 
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"delayed actions." His finding of decreased 
alpha was associated with evidence from 
behavioral observations as well as evidence 
from another physiological measure (heart 
rate). The EEG changes he observed were 
accompanied by overt signs of sleep (marked 
delay in response to a signal, snoring) along 
with a reliable fall in heart rate. From this 
additional evidence it seems clear that 
Bjerner’s Ss were at these times drifting 
down the activation dimension. 

It is unfortunate that Armington and 
Mitnick (1959) presented no physiological 
evidence besides the 8-12 cps EEG data; 
but it seems likely that they are correct in 
interpreting their findings of low alpha dur- 
ing sleep deprivation as being due to drowsi- 
ness. In other words, it appears probable 
that they were unsuccessful in their attempt 
to investigate the effects of sleep deprivation 
under conditions less favorable for dozing 
than in previous studies: having their Ss 
stand was certainly no guarantee that the 
Ss would not drift toward sleep. 

To reiterate, in our experiment the EEG 
and other Physiological findings are con- 
sistent with the interpretation that our Ss 
were alert and in a high state of activation. 
We are not aware of any instance where rise 
in palmar conductance or respiration rate 
has been reported to coincide with a drift 
down the activation dimension toward sleep. 

Because so little has been reported about 
the 24 eps component of the EEG in 
waking Ss it is difficult to interpret the 
finding that tracings from S, and S, showed 
a significant fall in amplitude. However, 
two rather general comments may be made. 
Firstly, it is interesting to note that once 
again it was 5, who was the exception. 
Secondly, 5, whose tracking performance 
deteriorated more than the others showed 
the greatest fall in 2-4 cps EEG. If $\% 
drop in performance were interpreted as 
due to drift toward a sleepy state, it is likely 

that, if anything, a rise in amplitude of 
frequencies in this low range would have 
been expected instead of the fall that was 
observed (Malmo, 1959, p. 380). The same 
general argument holds for S, who showed 
a slight drop in performance. S, it will be 
recalled, actually improved in tracking in 


the second half of the vigil so that the 
slight rise in 2-4 cps EEG amplitude pn 
hardly be accounted for in terms of increas 
ing drowsiness in this S. 


Other Physiological Measures 


Heart Rate. Sy and S; showed significant 
rise in heart rate during the vigil. This 
finding was consistent with the і rise 
palmar conductance and in respiration ra d 
The results from Ss, on the other hand, 
were the exact opposite; in his case her 
was falling heart rate which on the basis c 
а comparison between first and last halves 
of the vigil was statistically significant. 

This reversal in heart rate is difficult b» 
interpret considering that the palmar SEG 
ductance, respiration rate, and 8-12 cps EE a 
data were very similar to those of the other 
two Ss (see Table 2). However, we 
note that, in referring to the experiments x 
Hickam and his co-workers, Gellhorn (1953, 
p. 338) indicated that under activating соп" 
ditions which produce heart rate accelera- 
tion in the majority of Ss, some Ss sho" 
diminished heart rate and lowered bloo 
pressure. As a matter of fact, S, did pe 
а fall in blood pressure from the first = 
to the second half of the vigil, and althoug" 
the difference was not large it was sig”! ? 
cant (at the .01 level), Sı, on the othe 
hand, showed a rise in blood pressure from 
the first half to the second half of the vis! 
though this rise was not statistically signi®- 
cant. The attempt to record blood pressure 
from S, was abandoned because finger 
movement artifacts made it impossible tO 
use this instrument (which depended on а 
finger pickup), 

However, it should 
there is evidence ag 
simply 


be made clear that 
ainst the notion that Sz 
reacts with a drop in heart rate 
under all conditions in which the heart 
rates of others increase. Data taken several 
months before his participation in the sleep 
deprivation experiments clearly showed 
typical heart rate acceleration in this 5 
under certain activating conditions. Com- 
parison of heart rate for 5-min. trials durin£ 
the last half of the session with that for th? 
5-min. trials during the first half of ® 
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session showed a significant rise in the sec- 
ond half. 

Finally, it may be noted that even in the 
second half of the vigil despite the down- 
ward shift in the heart rate baseline, heart 
rate rose regularly at the end of each trial, 
during the time that the S was waiting for 
and receiving information concerning his 
performance on that trial. 

EMG. As previously stated each 5 
showed a significantly increased tension in 
one particular muscle (left neck, right bi- 
ceps, and forehead, for S, Sa and. 53, 
respectively), and all Ss showed a progres- 
sive decrease in level and in steepness of 
gradient of EMGs from the left pronator 
teres muscle during the vigil (see Fig. 6). 

The findings of increased muscle tension 
localized in one muscle group has been dis- 
cussed previously in relation to the specificity 
of localized muscle tension in psychiatric 
patients with complaint of discomfort from 
these tense muscles (Malmo, 1959). The 
previous statement was as follows: 

Where high level activation is long continued as 
in a vigil or in certain psychoneurotic patients, it 
appears that skeletal tension may become localized 
to a single muscle group. The discomfort asso- 
ciated with this tension in some patients can become 
extremely severe. It should be noted that in one- 
session experiments, where rise in activation was 
for relatively short intervals of time, tensional 
rise occurred in more than one muscle group. 

Methodologically, these results are important 
because they reveal a difference between EMGs 
and some other physiological measures with respect 
to gauging activation. Unlike heart rate or respira- 
tion rate that invariably yields one measure no 
matter how it is recorded, there are as many 
measures of muscle tension as there are muscles 
that can be recorded from. It appears that when 
sufficient care is taken, EMGs may be very valuable 
in helping to gauge activation, but that considerable 
caution is required in the interpretation of results, 
and especially in the interpretation of negative 
results (p. 383). 

At the very end of the vigil each S was 
given an extra 5-min. trial, after being in- 
formed that this was his very last tracking 
trial. Under these conditions it was inter- 
esting to note that the tension in the left 
eres muscle that had been gradually 
decreasing all through the vigil, returned 
to the level it had reached during the early 
trials and also showed approximately the 


pronator t 


same steepness of gradient. This finding 
suggests that there might have been some- 
thing like a conservation of energy mechan- 
ism operating to reduce the tension in the 
left arm during the vigil, and that when 
the vigil was over at last, and such con- 
servation of energy was no longer required, 
the tension in the left arm returned to its 
previously higher level. A puzzling question 
is why, if such a conservation mechanism 
existed, it did not operate on all of the 
muscles. Of course with a larger group of 
55 and with more channels of EMG it is 
quite possible that variations would be 
found, with some individuals showing rise 
in more than one muscle, and some individ- 

uals perhaps not showing a rise in any 

muscle. However, considering the small 

number of Ss in this investigation, the 

consistency of the findings was remarkable. 


Pain Thresholds 


The three Ss were consistent in the direc- 
tion of change (a drop) in pain threshold 
with sleep deprivation, although S.’s change 
was not statistically significant. These find- 
ings are on the whole consistent with those 
of Cooperman et al. (1934) whose report 
was summarized in the introductory section 
of this paper. Although the negative results 
of Schumacher et al. (1940) are occasion- 
ally cited as evidence against a change in 
pain threshold with sleep deprivation, this 
does not appear to be a fair conclusion 
because the threshold determinations were 
carried out after only 24 hours of sleep 
deprivation. Actually the main purpose of 
the experiments by Schumacher and his co- 
workers was to test the influence of a 
rather moderate amount of sleep deprivation 
on the pain threshold. 

As far as we know, pain is the only 
modality that has been found more sensitive 
under conditions of sleep deprivation. In 
the introductory section we briefly sum- 
marized the findings concerning other sense 
modalities. From this evidence it appears 
that pain is unique with regard to the effect 
sleep deprivation has upon it. This unique- 
ness is of interest in relation to Hebb’s 
(1949) theory of pain. Hebb emphasizes 
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SUMMARY AND CONCLUSIONS 


Three separate 60-hour vigils were held 
with each of three healthy young male S's. 
During each vigil, 10 1-hour recording ses- 
sions were held several hours apart. In each 
session the following measurements were 
recorded continuously while the S was per- 
forming on a tracking task with 5-min. 
trials and 1-min. rests: EEG, EMGs from 
various muscles, respiration, palmar con- 
ductance, and heart rate. 

Main findings were that in all three Ss, 
palmar conductance and respiration showed 
а statistically reliable increase, and 8-12 
cps EEG a reliable decrease in amplitude. 
Heart rate was reliably increased in two of 
the three Ss, (though reliably decreased in 
the third $). In addition, each S showed 
reliably increased muscle tension in only 


FN muscle group (a different one in each 
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On the whole the data strongly supported 
the conclusion that sleep deprivation had the 
effect of increasing level of activation. In 
this respect the present investigation ap- 
peared to be different from certain other 
investigations of sleep deprivation in which 
conditions were not sufficiently alerting to 
prevent the 5s from drifting toward sleep. 
With the overall indication of increasing 
activation from the peripheral measures, 
the fall in 8-12 cps EEG was interpreted 
as also indicative of increasing activation, 
and not as drift toward sleep. га 

Under conditions of this investigation, 
increased muscle tension appeared to be 
localized in one muscle group rather than 
generalized over the entire body. In fact, 
tension in some parts of the body actually 
fell progressively during the vigil. 

It appeared that the special conditions of 
the tracking sessions were particularly con- 
ducive to the appearance of hallucinatory 
experiences. 


| 
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CHANGES DURING SLEEP DEPRIVATION 
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AND 


Tm present study investigates the rela- 
tionships between ability variables and 
progress in learning a complex perceptual- 
motor skill. It represents an extension of 
previous research in which ability patterns 
at early and advanced performance levels in 
psychomotor tasks have been compared 
with a view to establishing the kinds of 
abilities and measures most predictive of 


higher levels of proficiency. These studies’ 


have implications for questions concerning 
the processes involved in learning complex 
skills as well as for problems in training 
and test development. 

The distinction between the terms 
“ability” and "skill" as used here should 
be made explicit. The term ability refers 
to a more general, stable trait of the indi- 
vidual which may facilitate performance in 
a variety of different tasks. Thus, the 


‚ ability of spatial-visualizjtion may be im- 
. portant in such diverse activities as naviga- 


tion, dentistry, and engineering. The term 
skill is more specific: it is task oriented. 
Flying an airplane is a skill, while manual 
dexterity and spatial-visualization are more 
general abilities. Of course, abilities them- 
selves are often products of earlier learn- 


.''This study was initiated in 1956 under Air 
Force Contract AF 41(657)-64 between Psy- 
т оса Research Associates, Inc. and Ат 
Ferce Personnel and Training Research Center. 
In 1958 cognizance of this contract was trans- 
ferred to the Aero Medical Laboratory, Wright 
Air Development Center. Edwin A. Fleishman 
served as Project Monitor while a member of the 
Skill Components Research Laboratory of Air 
Force Personnel and Training Research Center. 
He retained his association with the project while 
on the faculty of Yale University. Robert A. 
Seibel, Bryce O. Hartman, and Marty R. Rock- 
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ing. This view is elaborated elsewhere 
(Gagné & Fleishman, 1959). 

Tt has been found that for certain classes 
of perceptual-motor tasks, there are pro- 
gressive qualitative and quantitative 
changes in those abilities which contribute 
to proficiency as the task is learned. This 
has been demonstrated with a serial ге- 
action task (Fleishman & Hempel, 1954a), 
a visual discrimination reaction task 
(Fleishman & Hempel, 1955), and a con- 
tinuous pursuit task (Fleishman, 1956). 
Woodrow (1938) has found the same pro- 
gressive changes to be involved in extended 
performance on printed tests. In other 
words, for those tasks proficiency at later 
stages of learning depended upon different 
combinations of abilities than did initial 
proficiency. 

The present investigation differs from 
others in this area in two major respects. 
First, the task is considerably more com- 
plex than that used in previous studies. In 
the present instance the task roughly simu- 
lated that performed by a pilot during the 
attack phase of an airborne radar intercept 
mission. Thus, the task was more "real- 
istic" and more demanding than the labora- 


way served as project monitors during the latter 
stages of the study. 

The correlation analysis and factor analysis 
procedures were accomplished under terms of a 
subcontract with Psychological Research Service, 
Inc, of Austin, Texas. They were directed by 
Benjamin Fruchter. 

The tracking devices and scoring consoles were 
designed by Henry P. Birmingham of the Naval 
Research Laboratory. 

Permission is granted for reproduction, transla- 
tion, publication, use, and disposal in whole or in 
part by or for the United States Government, 
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tory tests previously investigated. The 
second difference involved the total amount 
oi learning time which was investigated. 
A limitation of previous studies has been 
the relatively short period required to 
achieve a high level of proficiency. In the 
present study subjects were trained Гог 17 
sessions distributed over 1 to 1.5 months. 

The increased complexity of the present 
task raises the question of whether new 
kinds of abilities are required and whether 
the patterning of these abilities is different 
from that found with previous tasks. For 
example, the introduction of complexity of 
function may demand certain skills of in- 
tegration and coordination not previously 
required. 

A primary purpose of this study, as 
stated above, was to specify the abilities 
underlying terminal proficiency on a com- 
plex control task and to identify the man- 
ner in which the contribution of these 
abilities to proficiency changes during the 
course of learning. Another important ob- 
jective was to compare the predictability 
of terminal performance from external 
measures with predictions from measures 
taken earlier in practice on the task itself. 
In a previous study (Fleishman & Hempel. 
1954a) it was found that advanced per- 
formance on a psychomotor task, the Com- 
plex Coordination Test, was better pre- 
dicted from a combination of printed and 
apparatus tests than from early scores 
taken on the task itself. This study рге- 
sented an opportunity to test the generality 


of this finding in a more complex task situ- 
ation, 


PROCEDURES 


Data Gathering 


The experimental procedures followed in this 
study are similar to those used in previous studies 
in this series. Subjects were first administered 
а large reference battery of printed and psycho- 
motor apparatus tests. Following this the same 
subjects spent a number of sessions m 


astering the 
criterion. task. 


Performance measures were taken 
at the various stages of practice on this task. 
Correlations among scores on both the reference 
battery and the stages of learning of the criterion 
task were obtained and factor techni 


3 \ ques applied. 
The factors were defined from the 10: 


adings of the 
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reference tests. Loadings of the various stages гч 
practice of the criterion task on the factors m 
fined by the reference tests specify the changes ыы 
the factor pattern of this task as practice со 
tinues. 


Subjects 


The subjects were 203 freshman and О 
Ат Force ROTC students at the University oe 
Maryland. No subject had had pilot training © 
any kind. ch subject was paid for his partici- 
pation in tliis study. 


Reference Dattery 


The initial testing consisted of the administ С 
tion of a large battery of psychomotor anna 
and printed tests. Initially a listing was made © 
those factors which might be of importance a 
mastering a complex tracking task. This list ever 
included such abilities as "Verbal Comprehension, 
inasm as this ability might be of importance Е 
understanding the explanation of the requireme! * 
of the task and thus be involved in Vp id 
ultimate proficiency. The majority of the rue : 
ence tests were selected in terms of their ен 
loadings on these abilities їп previous ines 
tions. A complete listing of the hypothesize". 
ability factors and the loadings of the various 
tests on these factors is presented in Appendix А: 
The test battery consisted of 23 apparatus and d 
printed te Fifty different scores were РГО 
vided by this battery. 


Total time for the administration of this taper 
was 11 hours, This was divided into three pipe 
sessions for the administration of the printed tests 
and five I-hour sessions for the administration OF 
the apparatus tests. Two models of cach аррагані 
test allowed for the testing of subjects in palts. 
A complete listing of the tests comprising the 
reference battery 15 presented in Table 1. This 
table also. presents reliability coefficients. for cach 
test in the battery, Appendix D presents drawings 


of the apparatus tests and descriptions of all tests 
both printed and apparatus, 


Criterion Task 


Subject Task. The criterion t 
а tracing device constructed so 
roughly the display 
requirements of an ai 
sion. 


ask consisted ol 
as to simulate 
characteristics and control 


rborne radar intercept. mis- 
The task of the subject was to maintain the 
target dot at the center of the oscillograph display: 
while at the same time nulling a sideslip indicator 
That is, the subject envisoned himself to be flying 
the attack phase of an airhorne radar interce?! 
mission. Thus, if the target was to the right, th 
subject made appropriate control movements 

steer the craft to the right. These movemen 
would bring him "on target” and the dot wor g 
return to the center of the display, All tur?" 
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TABLE 1 


PRINTED AND APPARATUS TEsTS COMPRISING REFERENCE BATTERY 


Test Test Reliability? 
Printed 
1. Aerial Orientation .84 
2. Complex Movements .85 
3. Coordinate Movements .70 
4. Directional Control .85 
5. Discrimination Reaction Time “87 
6. Following Directions .15 
7. Forced Landings 88—01 
8. Formation Visualization 76 
9. General Mechanics “88 
10. Instrument Comprehension “92 
11. Mechanical Comprehension “32 
12. Pattern Comprehension 776 
13. Planning a Course E 
14. Signal Interpretation E 
15. Spatial Orientation .69 
16. Speed of Identification ‚74 
17. Stick and Rudder Orientation .7А 
18. Verbal Comprehension (Vocabulary) .90 
(50) Verbal Comprehension (Background for Current Affairs) .87 
19. Visual Pursuit .80 
20. Visualization of Maneuvers .87 
Apparatus 
21. O'Connor Finger Dexterity .16 
22. Purdue Pegboard (Summation Score) .905 
23. Minnesota Rate of Manipulation (Placing) .87 
24. Minnesota Rate of Manipulation (Turning) .19 
25. Two-Plate Tapping “99 
26. Precision Steadiness (Errors) .90 
27. Direction Control (Corrects) .90-.95 
28. Control Sensitivity ‚94° 
29. Ten Target Aiming (Errors) .94 
30. Ten Target Aiming (Corrects) .91 
31. Bimanual Matching . 145 
32. Track Tracing (Errors) .91 
33. Two-Hand Coordination .89-.93^ 
34. Choice Reaction Time .68° 
35. Complex Coordinator .85-.95 
36. Pursuit Confusion (Errors) .90 
37. Pursuit Confusion (Corrects) .88 
38. Motor Judgment 56° 
39. Discrimination Reaction Time .81-.96 
40. Rotary Pursuit 92-.97 
41. Visual Reaction Time .86 
42. Auditory Reaction Time .85 
43. Jump Visual Reaction Time .88 
44. Jump Auditory Reaction Time .88 
45. Steadiness Aiming 84 
46. Single-Dimension Pursuit .84-.92 
47. Rate Control .89 
48. Rotary Aiming .96 
49, Rudder Control .90-.92 


» Reliabilities for those tests not computed directly from data of this study are taken from the following: Fleishman (1958b) 
Guilford and Lacey (1947), Roff (1951). 

b Based on average reliability of the four ви 

« Computed using data of this study (split-half). 


bparts of this test and corrected for an increase in length by a factor of four. 
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movements required coordinated action of stick 
and rudder controls, 

Three identical tracking devices were con- 
structed especially for purposes of this study and 
are shown in Figures 1 and 2. A block diagram? 
is presented in Appendix C. These devices and 
related scoring consoles allowed for the testing of 
from one to three subjects simultaneously under 
the control of a single test administrator. 


? Complete schematics of all components are 


Presented in Wright Air Development Center Re- 
port TR 59-255. 


A Photograph of tracking facility show- 
ing three tracking devices, computer programer 
and scoring console, | 


Fic. 2. Photograph of tracking device, 


Each subject was given a brief indoctrination 
into the nature of the task and the action of the 
control system. He was then allowed 5 to 10 
seconds of practice. Following this, through the 
remainder of the training program, there was no 
formal guidance of any sort. Each subject learned 
the task оп his own. However, all questions of 
subjects related to the operation of the. stem 
were answered. Each subject was told his inte- 
grated error score following every trial. 


Displays. "The subjects instrument panel con- 
tained two displays. The first consisted ofa m 
get dot presented on a cathode ray oscillograph. 
The target course was generated by setting the 
equation of a swinging pendulum into an analog 
computer. This produced a sine wave with 9 
frequency ої approximately 6 cycles per minute 
in the horizontal coordinate as the target enge 
The rate of decay in amplitude was approximate y 
555 per cycle. However, the dynamic character 
tics of the over-all task were such that Sub ee 
neither perceived this drop in amplitude nor ph 
fact that the target was programed in one ee 
ordinate only. Any control imbalances resulted 18 
dot excursions of considerably larger magnitut 
than those provided by target programing. 3. 

Beneath the oscillograph was an inverted lip 
inch, zero-centered voltmeter termed a pear 
indicator." This meter reading indicated to НЕ 
subject a "lack of coordination" in control action 
when centering the target dot. "This indicator ex 
not constitute an independent task but rather pro" 
vided the subject with additional information FÊ 
lated to the primary task, " 
, Control System, In performing this task d 
Jects used a standard aircraft control system " 
volving a control stick and rudder pedals. De 
controls were coupled in a manner similar to ors 
Of an actual aircraft, Thus, application of ^ 
control stick pressure without proper amount 
right rudder produced a sideslip to the left a“ ы 
consequent left deflection on the sideslip indicator 

Control of the target dot in elevation was n 
complished by forward and backward movemen" 


^ гаег 
of the control Stick. This was a pure secon -orde 


E Е n- 
System resulting from the use of two cascaded 1 
tegrators 


. „їп the linkage between the control a 
display. Thus the acceleration of the target 
was directly Proportional to stick displacement. 
pied of the target dot in azimuth was accom 
mew right and left movements of the ie eom 
tion гол COmprised a system involving acce This 
9n control plus ап exponential lag network. Е 
dimension was mechanized using three cascade 
integrators with a negative feedback loop aroun 


one. The time constant of this lag network 15 = 
second; that j achiey 


Ы , Is, it requires 1 second to à 
approximately two-thirds (1-1/е) of the fm 
signal 


Па! resulting from a given stick @іѕріасетеп 

Control of the target dot in azimuth (envisone’ 
as turning the aircraft) and centering the pss 
Slip indicator (coordination display) were gs 
eflected by rudder pedal displacement as well * 


PREDICTORS OF COMPLEX TRACKING BEHAVIOR 


by the control stick. This rudder control of the 
sideslip indicator involves a simple lag network. 
Thus, the sideslip indicator displacement was di- 
rectly proportional to rudder pedal displacement. 
Movement of the sideslip indicator by stick action 
represents a velocity control operating through 
two exponential lag networks. 

Movement of the target dot by rudder pedal 
displacement approximates a pure velocity control. 

Criterion Scores. The following five perform- 
ance measures were obtained during each testing 
session, Four of these were error scores; one was 
a time-on-target score. The error scores represent 
tlie extent of target dot (or indicator needle) dis- 
placement. from the null position in one dimension 
summated through time. 

1. Integrated absolute error score. This was 
recorded at the conclusion of every trial and was 
produced by summing algebraically the three 
absolute error part scores described below in ac- 
cordance with this relationship: 


T 12K + 1/2Y-42 [1] 


T = Integrated absolute error score 
Х = Absolute azimuth error 

У = Absolute elevation error 

Z =Sideslip error 

The double weight for the sideslip error was 
introduced on the basis of pretest results in which 
it was noted that some subjects neglected rudder 
pedals altogether and used only the stick to con- 
trol the dot. The increased weight made a good 
total score more dependent upon proper rudder 
pedal use. 

Each of the following part scores was recorded 
every third trial. An automatic sequencing 
mechanism switched from one part score to the 
next at the conclusion of each trial, 

2, Azimuth (yaw) part score. This is a summa- 
tion of the total error occurring in the horizontal 
is of the oscillograph display. 

3. Elevation (pitch) part score. This is a sum- 
mation of the total error occurring in the vertical 
axis of the display, 


Where 


ay 


p 


This is 
between 


4. Sideslip error part score. 
of the lack of coordination 
rudder movements. 

5. Time-on-target score. This score represents 
the time during the trial when the target dot 
and the sideslip indicator needle are both (simul- 
taneously) within prescribed tolerance limits. 

The time-on-target area consisted of a diamond- 
shaped area (a tilted square) in the center of the 
oscillograph. This area was not demarked for 
benefit of the subject. Distance from the center 
of the oscillograph to any corner of the square 
was 1 inch. Sideslip error also entered into the 
calculation of time-on-target. Sideslip tolerance 

as “= 0.5-inch needle displacement from cen- 
Thus, if a subject approached the tolerance 


a measure 
stick and 


ter. 
limit in the X axis of the scope display, a slight 
deflection of the sideslip indicator would throw 
him out of tolerance. 


Data Analysis 
Reference Test Scores 


From each test, the obtained distributions of 
raw scores were transformed to normalized distri- 
butions of standard scores (stanines), each with 
a range from 1 to 9, a mean of 5, and a stand- 
ard deviation of 2. Conversions were made so 
that the 9 end of the scale was always indica- 
tive of good performance, 


Tracking Performance 


ch of the 17 practice sessions for the criterion 
tracking task consisted of 21 1-minute trials, 
making a total of 357 trials. 

The problem of deciding on the size of the 
time segment to be sampled from these trials in- 
volved the following considerations. It was neces- 
sary to have each segment large enough to be 
sufficiently reliable; at the same time, it was de- 
sired to keep the segment small enough to provide 


TABLE 2 


RELIABILITIES OF THE 


VARIOUS MEASURES OF TRACKING PERFORMANCE АТ THREE STAGES 
DURING THE LEARNI 


чє PERIOD 


Integrated Horizontal Vertical Sideslip Time-on- 
Error Error Error Error Target 
Early Stage .83 .64 -60 .63 .58 
(Segment 1-Segment 2) 
Intermediate Stage .87 .67 .68 .87 .78 
(Segment 25-Segment 26) 
89 .56 -67 -96 .89 


Final Stage 
"(Segment 50-Segment 51) 


Note. АП correlations based on 203 cases. 
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as precise as possible a definition of the learning 
curve. Thus, use of an entire session as a segment 
might be highly reliable but changes occurring 
during the session might be obscured. 


For each session, scores of the first three trials 
were discarded in order to avoid problems of 
warm-up effect. The remaining 18 trials of each 
Session were then subdivided into three groups of 
six trials each. Each group of six trials was 
designated "time segment" of performance. There 
were thus 51 time segments obtained through the 
full period of practice. For each of the over-all 
measures of performance (integrated error score 
and time-on-target score) one time segment repre- 
sents 6 minutes of performance. For each of the 
part scores (elevation, azimuth, and sideslip 
error), a time segment represents 2 minutes of 
performance, a part score being recorded on every 
third trial only. Table 2 Presents reliability co- 
efficients for each measure of performance at an 
initial, intermediate, and terminal stage of learn- 
ing. These coefficients represent the correlation of 
adjacent time segments, 


Learning Curves. Figures 3 through 7 present 


This error curve shows 
about Segment 15. This 
any subjects did not use 
У great extent during in- 
It appears that although 
requirements of this task, 


Curve differs from the other: 
moidal rather than exponential, 


the time-on-target represents 


Performance а 


а poor measure of 
at the extremes 


when the subject 
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Fic. 7. Performance curve based on sideslip 
(coordination) error measure. 


Figures 3 through 7 illustrate group perform- 
ance and present relatively stable curves. There 
was, however, considerable variation among in- 
dividuals with respect to performance on this task. 
Figure 8 presents performance curves based on 
integrated error score for three subjects. These 
curves were selected to illustrate the extent of in- 
dividual differences both in rate of learning and 
in terminal proficiency. Subject A, for example, 
is the poorest initially but rapidly improves to a 
high level of proficiency. Subject B improves 
slowly and erratically throughout the period of 
practice, although initially he is better than Sub- 
ject A. Subject C shows relatively little im- 
provement at any time. 


Pu SUBJECT А —. 
Из SUBJECT В — 0 
8 2500 SUBJECT C, —X 


INTEGRATED ERROR Sí 
a 
o 
o 


о 5 10 15 20 25 30 35 40 45 50 
TIME SEGMENTS OF PRACTICE 


Fic. 8. Curves of three selected subjects illus- 
trating the extent of individual variation in per- 
formance. 


Selection of Practice Stages for Factor 
Analysis. It was decided that 10 stages of prac- 
tice would be used in the subsequent factor 
analysis as а basis for determining the changes 
in underlying ability patterns during the learning 
of this task. The selection of the appropriate 10 
stages of practice was accomplished before it was 
possible to compute the mean scores for the entire 


sample of 203 subjects. For this purpose data 
from a random sample of 25 subjects were chosen, 
and a curve using integrated error score as a 
measure of performance was generated. Figure 9 
presents this curve and indicates the 10 stages of 
practice which were selected for inclusion in the 
final data analyses. Points were selected in a man- 
ner it was felt would maximally define critical 
periods during the learning process. All perform- 
ance measures at each of these 10 practice stages 
next were converted into stanine form for fur- 
ther analyses. 


INTEGRATED ERROR SCORE 


123 4 5 6 7 8 9 10 и I2, 13 14 15 I6 
30 35 40 45 50 


о 5 ю 15 20 25 
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Fic. 9. Curve based on integrated error score 
of 25 randomly selected subjects showing selection 
of 10 stages of performance. 


Correlational Analyses 


Following the transformation of all refer- 
ence test scores and criterion measures into 
stanine form, subsequent data analyses were ac- 
complished through use of the ORDVAC high- 
speed digital computer. Appendix D presents the 
matrix of intercorrelations among all printed and 
apparatus tests of the reference battery. Appen- 
dix E presents intercorrelations between reference 
tests and performance measures of the tracking 
task. Appendix F presents intercorrelations be- 
tween and within total scores and component 
scores taken from the task itself. 

The matrix of intercorrelations of reference 
tests was factor analyzed using a digital computer 
program yielding a principal axes solution. 
Fifteen factors were extracted by this method. 
Appendix G presents the unrotated factor load- 
ings for tests in the reference battery. Orthogonal 
rotations of the reference axes were made, using 
the graphical method of Zimmerman (1946). The 
correlations of performance scores (integrated 
absolute error) with the reference test scores next 
were used in projecting performance measures 
onto the rotated axes defined by the reference bat- 
tery. The procedure used was Mosier's extension 
method as described by Fruchter (1954). Appendix 
H presents the rotated factor loadings of all 
reference test scores and measures of perform- 
ance. 
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are involved, primarily (Fleishman, 1953, 
1954; Fleishman & Hempel, 1955; Hempel 
& Fleishman, 1954), 


No. Variable Loading 
21 O'Connor Finger Dexterity .49 
22 Purdue Pegboard .43 
23 Rate of Manipulation .36 
Factor XV—Multilinb Coordination. 


This factor is common to tasks in which 
the requirement is the simultaneous use of 
both hands, both feet, or hands and feet. 
These tests have defined this factor in pre- 
vious studies (Fleishman, 1958b; Fleishman 
& Hempel, 1956). 


No. Variable Loading 
49 Rudder Control -40 
35 Complex Coordination Test .38 
33 Two-Hand Coordination -30 


Relation of Ability 


on | Factors to Stages of 
Practice in Tracking 


. This study was concerned with determin- 
ing the kinds of basic psychomotor abilities 
underlying tracking Proficiency and with 
the manner in which these abilities change 
аз а function of practice, Figure 10 illus- 
trates the extent to 
abilities account for 
ing performance at 
of learning. It will 
in the terminal stages 
account for as much 
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Fic, 10. Percentage of variance 

performance accounted for by ail 
tors. 


e in tracking 
identified fac- 


variance. [igure 10 clearly indicates that 
the set of abilities defined by the refer- 
ence battery in the present study does not 
account for any substantial portion of the 
variance in tracking performance. This is 
in contrast to previous studies with other 
psychomotor tasks. Гог example, т the 
case of the Complex Coordination Test up 
to 7506 of the variance was accounted for 
by independent reference ability measures 
(Fleishman & Hempel, 1954a) ; over 50% 
was accounted for in the Discrimination 
Reaction Time Test (Fleishman & Hempel, 
1955); roughly 609% in the two-hand ae 
unidimensional matching and plane contro 
tasks (Fleishman, 1957а). | hé 
Consideration next was given to a 
nature of the changes occurring during 
practice in those abilities which do show 
some relation to tracking performance. 
Based on integrated error score as 2 
measure of performance, only Factors 
(Spatial Orientation), УП (Response 
Orientation), XII (Pursuit Confusion); 
and XV (Multilimb Coordination ) had 
factor loadings in excess of .20 at any stage 
of practice. Of these, only Spatial Orienta- 
tion and Multilimb Coordination show pi 
tematic changes in importance during em 
course of practice. Figure 11 present? 
curves illustrating changes in the рег 
centage of variance accounted for by these 
two abilities with increased practice. Multi- 
limb Coordination shows an increase 1 
importance during the course of learning 
while Spatial Orientation appears to be 
important only during the middle stages. 
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Consideration was given to the possibility 
that use of only 6 minutes of performance 
in the integrated error criterion measure 
might not have been a sufficient behavior 
sample and might have lowered the corre- 
lations obtained between performance and 
reference tests. Certainly use of an ex- 
panded measure of performance would in- 
crease the reliability of this measure even 
though it had previously been found to be 
sufficiently high (.89 at the terminal prac- 
tice stage). In order to examine this possi- 
bility, several correlations between inte- 
grated error and three of the reference 
tests at the three stages of practice were 
recomputed, using a 21-minute perform- 
ance measure rather than a 6-minute 
measure. Table 3 presents both sets of 
correlations. It does not appear from the 
changes in the correlation coefficients that 
any significant. increase would have been 
obtained by using an expanded measure of 


performance. 


Relationships among Component Perform- 
ance Measures 

It will be recalled that, at each of the 
practice stages, five measures of perform- 
ance were obtained. These measures were: 

1. Integrated absolute error 

2. Azimuth (X) error 

3. Elevation (Y) error 

4. Sideslip (Z) error 


5. Time-on-target 
simultaneously ) 
The integrated absolute error score is not 
an independent measure but is determined 
by the X, Y, and Z scores in accordance 
with the relationship described previously 
in Equation 1. 

The above performance measures were 
obtained primarily with a view to answer- 
ing the question: "For this type of task, 
how well do component performance 
measures predict terminal proficiency?” 
Table 4 presents the intercorrelations of all 
performance measures at an initial point in 
learning (first time segment) and at an 
intermediate point (twenty-fifth time seg- 
ment). The table also presents the corre- 
lations of all measures, at each of these 
stages, with terminal proficiency (fiftieth 
time segment) as measured by the inte- 
grated error score. It is apparent that in- 
itially all component measures are uncorre- 
lated with terminal proficiency. In other 
words, neither initial component measures 
nor over-all measures allow the prediction 
of final performance in this task. At an 
intermediate point in the learning period, 
component measures do show some rela- 
tion to the final score (X — 28, У = .36, 
Z = 34) but none is sufficiently high to 
serve as a very useful predictor. 


(scope and sideslip 


3]It will be recalled that the full course of 
practice was broken into 51 time segments. 


TABLE 3 


CORRELATIONS BETWEEN SELECTED REFERENCE TESTS AND A MEASURE OF PERFORMANCE 
ILLUSTRATING THE EFFECT OF EXPANDING THE MEASURE OF PERFORMANCE 


Correlations 
Previous* Expanded 

Aerial Orientation—Session 5 +25 .25 
Discrimination Reaction Time (Printed)—Session 5 .22 .25 
Complex Coordinator—Session 5 wel 24 
Aerial Orientation—Session 9 7 .14 
Discrimination Reaction Time (Printed)—Session 9 .18 aL 
Complex Coordinator—Session 9 ik] .14 
Aerial Orientation—Session 17 .18 .09 
Discrimination Reaction Time (Printed)—Session 17 .09 .12 

.13 .14 


Complex Coordinator—Session 17 


a The performance measure (integl 
in each case forms a 


rated absolute error) entering this correlation is a 6-trial “time segment." This time segment 
part of the 21-trial session which represents the expanded measure. 
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TABLE 4 


CE MEASURES AND A FINAL 
INTERCORRELATIONS AMONG COMPONENT PERFORMANCE MEASURES AND A Ет 
PROFICIENCY MEASURE 


Final 
3 1 
— бр JE Time-on- Integrated 
пава Х Еггог Ү Еггог Z Error Target Error 
rror - T em 
TIME SEGMENT 1 04 
5 © «35 * 
Integrated Error 25 * " EA 4n .02 
A .3 = = 08 
Y Enor -16 в, 1? 
Z Error —.02 
Time-on-Target 
TIME SEGMENT 25 48 
: .65 EE -63 92 128 
a. еы “68 -02 б 36 
Y Error .27 К Em 
Z Error .48 
Time-on-Target 


* The fact that time-on-target 


4 Ret scores are noi 
Scores to stanine form. At this ti 


me all score distributions were 


Discussion 


The rationale underlying this study 


is 
that there exists 


within the population 
under inquiry a finite number of abilities 


which are used in mastering a complex 
tracking task. These abilities are represent- 
ative of the class of theoretical construct 
which is Tesponse inferred, They are re- 


Sponse inferred jn that, although {һе 
characteristics of the task remain un- 
changed, the performance of individuals 
exhibits 


major variation, 
this study was to | 
terms of relations among the various per- 
formance measures (reference battery). 
Variability їп Performance at different 
Stages of mastering the tracking task then 
could be accounted for in terms of the 
identified abilities. The validity of such 
ability identification is attested to by the 
factorial invariance found in previous 
studies using different reference tests (see 
Appendix A). The Present study confirms 
this invariance using different subjects 
a different combination of tests, 


If tracking perfor 


and 


mance in fact can he 
accounted for in terms of a limited number 


of measurable human abilities, the results 


will have implication for such areas as 


t negatively correlated with the error score: 


ing raw 
5 is due to the process of converting 


АЕ ormance. 
arranged so that high score was indicative of good perfi 


human engineering design of equipmem 
personnel measurement and nee 
the identification of an efficient set of ane 
ing principles designed to improve tram 
in tracking procedures. soe the 

Initially it might be well to eee 
rationale behind this study and the d 
in which this study relates to the 1961) 
area of tracking research. Adams ( kinds 
indicates that there exist two major pura 
of tracking research. Engineering ded 
chologists have used the theory of "x for 
loop servomechanisms as a model “king 
investigating the p шай? 
system. Adams points out the ас “ies 
tages and disadvantages of this О! rch 
tation and notes that it results in resea 5- 
oriented toward task variables. Thus. im 
tem input-output relations are dE 
terms of time and frequency character re- 
and are described through output реа 
sulting as a function of changes in jp 
parameters. Such changes might m ar 
formed on the input signal, control Sy pee 
dynamics, or the control-error display 
lationships, 

General experimental psychology, О! 
other hand, has used traditional beh: ys- 
theory as its model and the tracking 5 


man-machine 


1 the 
avio" 
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tem becomes a means of eliciting desired 
behavior classes. Such research is oriented 
toward procedural variables related to 
training, retention, stress, fatigue, motiva- 
tion, etc. The major deficiency in this type 
of research is that apparently any task is 
considered appropriate if it elicits the de- 
sired response class. Little consideration is 
given to possible interactions between task 
and procedural variables. 

The present investigation appears to fall 
somewhere between these two types of re- 
search. Tt is concerned with task variables 
to the extent of insuring a complex 
criterion task and defining the dimensions 
of complexity. These are a two-dimen- 
sional display system, control character- 
istics varying from zero-order to second- 
order, and requirement for dual-error 
nulling. Task variables were not used as 
independent variables, however. This study 
was not concerned with procedural vari- 
ables as such, yet the results have implica- 
tions for such areas as learning theory and 
training. 


Identification of Perceplual-Motor Abilities 


One major end product of this study is 
found in the identification of the ability 
factors measured by the tests within the 
reference battery. Forty-three printed and 
apparatus tests were administered to these 
subjects. (Fifty scores appeared due to 
use of subparts and correct and error 
scores as separate measures of perform- 
ance.) Although the majority of these tests 
were chosen on the basis of their factor 
loadings in previous investigations, a num- 
ber of new tests were introduced. Results 
of the factor analysis of these test intercor- 
relations serve further to define the basic 
components of human perceptual-motor 
abilities. 


Changes in Ability Patterns with Practice 


The general finding in investigations of 
this type (Fleishman, 1956, 1959) is that 
< in the ability pattern underlying 


changes у p т 
»roficiency in the criterion task are quite 
J с ive during the learn- 


svstematic and progressive с earn 
ing period. Гог the tracking task of this 
s 
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study, however, Figure 11 indicates that 
only Factor I (Spatial Ọrientation) and 
Factor XV  (Multilimb Coordination) 
show a systematic relation to tracking per- 
formance during the course of practice. 
Apparently, early in practice the ability to 
interpret the visual display characteristics 
is of some importance. Later the ability to 
balance and to coordinate the activity of 
hands and feet in manipulating the control 
stick and rudder pedals plays some part in 
determining the proficiency of the subject. 
This is in keeping with the observation 
during the testing program that most sub- 
jects experienced extreme difficulty in 
effecting coordinated movements during 
the early part of the training program. 

The Multilimb Coordination factor be- 
gins its major increase in importance at 
approximately the fifteenth time segment 
(Figure 11). This is roughly the same 
period when the sideslip (Z) error score, 
which reflects coordinated hand-foot 
activity, begins to show considerable im- 
provement (Figure 7). There are thus two 
independent analyses in support of each 
other. An identified ability factor and a 
measure of component task activity both 
indicate the importance of coordination be- 
tween limbs in determining proficiency at 
a given point in the practice schedule. 


Predictability of Tracking Performance 


One basic problem for consideration with 
any complex activity is whether terminal 
proficiency is better predicted by a set of 
external measures or by scores taken from 
initial practice periods in the activity itself. 
With the present tracking task there is no 
correlation between initial tracking per- 
formance and terminal performance. (The 
correlation between first-segment integrated 
error scores and fiftieth, or final, integrated 
error scores is .04). Based on the first time 
segment intercorrelations shown in Table 
4, a multiple correlation analysis was per- 
formed. Results of this analy indicate 
the extent to which terminal performance 
is predictable through use of the five per- 
formance measures taken during the initial 
practice session. The obtained multiple cor- 
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relation coefficient is .17, which is not sta- 
tistically significant. In other words, scores 
obtained during initial practice periods in 
the activity itself do not predict terminal 
proficiency. It is interesting to note that 
even at the twenty-fifth time segment, 
when the subject has had approximately 4 
hours of practice in the device, the corre- 
lation of integrated error score with term- 
inal proficiency (terminal integrated error 
Score) has risen only to .46. 


In opposition to the above findings there 
are six of the reference tests which show 
correlations with final tracking perform- 
ance of .26 or higher. The intercorrela- 
tions among these tests are presented in 
Table 5. The obtained multiple correla- 
tion using these tests as predictors of term- 
inal proficiency is 43, which is significant 
at the .01 level. 

These results indicate that early com- 
ponent and total scores taken from the task 
itself do not predict advanced proficiency 
levels until an intermediate stage of prac- 
tice, and these are not high. If one must 
Choose between intratask and independent 
measures, selected independent measures 


will give а better prediction of advanced 
tracking proficiencv, 


Variable 1 36 37 


39 40 K 
RENÉ Nd 

T 5/25 29 37 
. M. 
11 -27 19 а 1 ^ 
36 48 .26 з 56 
2 31 .31 196 
а 29 .30 
.27 
UE... WAR = 


a These numbers correspond to th 


р м d е designation of th 

ing variables during the machine data anspor?! he follow. 
£ Var. 7 = Forced Landings analyses: 

Var. 11 = Mechanical Comp, 

Var. 36 = Pursuit Conf, fe) 

Var. e = Бш Conf. Com) 

Var. 39 — Discrimination Reaction Tin 

Var. 40 — Rotary Pursuit ne (Арраг.) 

Var. К = Segment 50 (Terminal) Integrated Ег; 


* Significant at the .01 level (N = 203). тог Score 


Human Performance in Tracking 


TThe over-all operation of tracking a 
moving target can be examined in terms of 
component responses required of the 
human being. Adams (1961) has classified 
the component responses in tracking be- 
havior into observing, predicting, and 
motor responses. These are defined below 
and are followed by a listing of those fac- 
tors identified in this study which should 
be important to each response. In some in- 
stances certain factors appear to emphasize 
elements of more than one response class. 


Observing Response. The sensing of 
play parameters including both input Signe 
and error signal is by the observing “a 
sponse. The observing response appears o 
involve head and/or eye movements lly 
direct the visual receptors to spatially 
separated stimuli, and the discrimination Ө, 
stimulus change. The following identi da 
factors would seem to be important for m 
observing response: 


Factor I. Spatial Orientation 
Factor ІХ. Perceptual Speed 
Factor X, Visualization 


Prediction Response. The prediction н 
ponse involves predicting the regule, 
inherent in the input signal and the e. 
of control movements upon the error E 
nal. This implies that the subject w 
learned the dynamics of the control syste 
and the interactions between control 0 
ments and display changes. o 
(1957) refers to such prediction as Pf 
ceptual anticipation, Perceptual anticipa- 
tion involves prediction of nature and AE 
of muscular contractions required, рге = 
tion of duration of response movement, anc 
Prediction of future position of target е 
time of completion of response movemen 
For the criterion task of this study the fo 2 
lowing factors would appear to be iM 


portant: 
Factor УП. Reg 1 i 
a ponse Orientation 
Factor X. Visualization 
actor ХІ, Integration х 
The Rate Control factor, which was E 


inally hypothesized, would fit in this =. 
gory. However, this factor was not iden 
fied in the Present study, 
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Motor Response. The motor responses 
are those activities involved in control sys- 
tem (stick and rudder) manipulations. It 
appears that the following abilities identi- 
fied within this study would be of impor- 
tance to such motor responses: 


Control Precision 

Speed of Arm Movement 
Manual Dexterity 
Response Orientation 
Arm-Hand Steadiness 
Integration 

Pursuit Confusion 
Finger Dexterity 
Multilimb Coordination 


Factor П. 


Factor IV. 

Factor VII. 
Factor VIII. 
Factor XI. 

Factor XIT. 
Factor XIV. 
Factor XV. 


Identification of Abilities Underlying 
Tracking Proficiency 

The preceding analysis would indicate 
that a successful tracking response is de- 
pendent upon a number of component re- 
sponses. These component responses 
would, in turn, involve certain basic “abili- 
ties." The major objective of this study 
was to determine the extent to which a set 
of abilities as identified through use of the 
reference battery of tests accounts for 
variability in tracking performance at vari- 
ous stages in the practice schedule. The 
possibility existed that all of the factors 
listed in the preceding section would be 
important. Аз it turned out, this was not 
the case. As seen in Figure 10, at no stage 
of practice did these abilities, taken in their 
entirety, account for over 25% of the vari- 
ance in tracking. As seen in Figure 11, 
only two of the identified abilities show any 
systematic relation to tracking perform- 
ance. These are Spatial Orientation, listed 
under Observing Responses, and Multilimb 
Coordination, listed under Motor Re- 
sponses. However, it should be noted that 
the contribution of the observing response 
factor decreases through practice while the 
motor response factor increases in impor- 
tance. 

The major question now becomes: “Why 
were these abilities not more effective in 
accounting for variability in tracking per- 
formance?” The need for some answer be- 
comes more obvious when it is realized that 
in a previous study ( Fleishman & Hempel, 


1955) using a different criterion task (the 
Discrimination Reaction Time Test— 
DRT), over 60% of criterion task per- 
formance could be accounted for at some 
practice stages using the same kinds of 
abilities as in this instance. This 60% is 
as opposed to less than 25% within this 
study. 

A possible answer may rest both with 
the nature of the tests within the reference 
battery and the nature of the criterion task. 
Considering the criterion task first, it has 
been noted that this tracking task was quite 
complex due to the dual display, the sec- 
ond-order control system characteristics, 
and the requirement for dual-error nulling. 
Another way of viewing this task might be 
that the subject was required to maintain a 
highly complicated and unstable system in 
a state of equilibrium. With tasks in- 
vestigated previously such as the DRT, 
Rotary Pursuit, and the Complex Coordi- 
nation Test, it appears reasonable that differ- 
ent abilities might be required for mastery. 
For example, with the DRT the subject ob- 
serves a visual stimulus which differs on 
successive presentations only in terms of 
four possible configurations. The subject 
reacts by pushing one of four toggle switches 
as quickly as possible. For this task it 
appears logical that such abilities as Spatial 
Orientation, Reaction Time, and Rate of 
Arm Movement would be important. 


The fact that a similar set of abilities did 
not account for a substantial amount of the 
variance in performance in the present in- 
stance may be explained if the task is ex- 
amined in light of the three required re- 
ponse arcas. The observing response in 
this task requires that the subject time- 
share between two separate features of the 
over-all display. The prediction response 
requires that the subject develop the capa- 
bility of predicting target position at some 
future time. When it is considered that the 
display actually is an error signal produced 
by improper control movements, it is ap- 
parent that prediction of target position 
would require a rather complex ability. 
This is particularly true since two dimen- 
sions of the control system involved second- 
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order controls. Thus the prediction of 
target position might involve such hypo- 
thesized abilities as the ability to perform 
double- and higher-order differentiations 
while compensating for built-in control Sys- 
tem lag effects. Certainly proficiency in 
this task would require a different set of 
abilities than would be found with the 


DRT or the other tasks investigated previ- 
ously. 


The particular selection of tests for the 
reference battery used in this instance 
represents an attempt to continue with the 
same type of tests used so su 
previous investigations em 
Criterion measures. 


ccessfully in 
ploying different 
The lack of success of 
this comprehensive battery of tests in de- 


fining a set of abilities which would relate 
to tracking proficiency warrants 
look at the tests selected. Tt appears that a 
set of abilities are important with this type 
of criterion task for which appropriate 
Predictive tests have not been developed. 
Further Work is required for the construc- 
tion of such tests, Thus, if the criterion 
task | is known to require time-sharing 
activities, the reference battery should con- 
tain tests designed to measure an hypo- 
thesized general time-sharing abilitv. There 
are no known reference tests which miu 
be used to measure such an ability, 
| In short, While the reference tests used 
in this study reaffirmed а number of basic 
human perceptual-motor 
abilities could not be related to any 
siderable extent to tracking pe 1 
A complex tracking activity apparently re- 
quires abilities for which appropriate 
reference measures are not available at 
present. These abilities, further, do not fall 
within the area customarily called “Motor 
Abilities.” Purely motor abilities 


а closer 


these 
con- 
rformance, 


do not 
determine | individual differences in ad- 
vanced tracking proficiency. The important 
abilities appear to fall within the areas of 


“Observing Responses" and, 
greater extent, “Prediction Responses.” 
Further studies, obtaining basic measures 
within these areas, should be abl 


propably to а 


* fo con- 
tribute additional information concerning 
the nature of a complex tracking activity, 
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SUMMARY AND CONCLUSIONS 


This investigation was designed lo 


pro- 
vide inform 


ation concerning the kinds of 
abilities which best predict per 
а complex tracking tas 
Was given to the manner in 
derlying abilit 
a function of 
task. 


formance in 
Consideration 
which the un- 
у pattern exhibits change as 
increasing proficiency in the 
The relation between carly component 
performance measures and terminal pro- 
ficiency in the tracking task also was ex- 
amined, 


V battery of reference tests containing 
both apparatus and printed tests of known 
factorial structure was administered to 203 
subjects. Following this cach subject spent 
a total of 17 sessions mastering the tracking 
task. Scores on the reference tests and 
various learning levels of the tracking task 
were analyzed through factor analysis tech- 
niques. Fifteen ability factors were identi- 
fied and were related to the different stages 
of practice in tracking. 

The following conclusions are drawn 
from the results of this study: 

1. The kinds of perceptual-motor abili- 
ties which previously have been found to 

'e related to performance on laboratory 


CTitenon t 


3$ do not account 
stantial amount of the variance in perform- 
ance with the present task. Tt is hypothe- 
sized that the complexity and particular 
dynamics of the tracking task used in this 
study require different kinds of abilities. 
Such abilities as Rate of Arm Movement 
apparently аге not as important to the 
mastery of complex tracking activities as 
with laboratory Psychomotor tasks, In 
tracking, the important determiners of pro- 
ficiency may be found in those abilities 
used in the analvsis of target motion and 
in the prediction of appropriate re 
movements, rather than in abilitie 
directly to control manipulation. 


2. Initial Component and total scores 
taken from the tracking task itself do not 
predict advanced proficiency levels; Tf one 
must choose between intratask and inde- 
pendent me sures, 
Measures give д | 
vanced tracking 


2 \ 
ior any sub- 


sponse 
х related 


the independent test 
etter prediction of ad- 
proficiency, 


v 
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APPENDIX B 


Description or Tests IN REFERENCE BATTERY 


Printed Tests 


1. terial Orientation. The subject is re- 
quired to match the cockpit view of the hori- 
zon that corresponds to the airplane from 
which this view would be scen. 


2. Complex Movements. Each. problem pre- 
sents a vertical line and above it a horizontal 
line. On each of these lines is a circle and a 
dot. Below the lines is a grid of 49 squares, 
one of which contains an "S." The examinee 
indicates in the grid from the letter S the 
direction required to “move” the dots on both 
lines simultaneously into their respective 
circles. 

3. Coordinate Movements. Each problem 
presents a series of arrows which vary in 
curvature and represent different directions of 
movement. The thickness of each arrow repre- 
sents distance of movement. To the left of 
these arrows are a dot and circle. The sub- 
ject selects the one symbol (arrow) that will 
move the dot inside the circle. 


4. Directional Control. Each. problem. pre- 
sents nine boxes side-by-side with a target 
circle above the middle box. Also pictured is 
ап arrow whose head indicates direction and 
Whose length indicates distance of movement. 
The subject indicates from which box it must 
be launched to have its head reach the target. 


5. Printed Discrimination Reaction Time. 
Each item simulates a setting of the apparatus 
test of the same name except responses are 
Made by pencil in one of four slots on answer 
Sheets, ^ The slots are arranged in the same 
Up-down, left-right pattern as are the toggle 


‘Witches on the apparatus version. 


6. Following Directions. Two panels of 
Уса! aircraft display instruments are pre- 
ented to the subject. As the subject reads 
x. text he underlines certain. words depend- 
8 upon relations among the instruments and 
t" directions appearing within the text. те 
rapi 15 designed to measure ability to work 
E under the pressure of complex direc- 
Ons, 


at ‘forced Landings. Aircraft are Donee 
represent, altitudes above a number of prm 
shown nting landing fields. Arrows are а s 
ON the display representing wind direc- 


tion and magnitude and up or down draft cur- 
rents. The subject must assess the situation 
and select the best landing field in terms of 
prescribed criteria. 


8. Formation Visualisation. The subject is 
presented with the top and side views of 
silhouettes of aircraft flying in formation. He 
then must select the correct alternative from 
among four front views of this same formation 
flight. 


9. General Mechanics. Printed items re- 
quire practical mechanical information dealing 
with the use of operation of familiar mechani- 
cal methods and devices. 


10. Instrument Comprehension. For each 
item which presents views of cockpit instru- 
ments the subject must determine the proper 
position and orientation of an airplane. 


11. Mechanical Comprehension. The subject 
is presented drawings of mechanical devices. 
Пе then must answer a series of questions con- 
cerning the operation of each device. 


12. Pattern Comprehension, А series of 
drawings is presented which require visualiza- 
tion. of relationships between components of 
solids ап their unfolded, flat projections. 


13. Planning A Course. The subject is pre- 
sented a drawing of a maze-like arrangement of 
boxes. Starting at one end the subject must 
trace his way through the boxes, responding 
to coded instructions as to direction of travel 
which are presented within the various boxes. 


14. Signal Interpretation. The subject is 
presented with a picture of a row of ships, jn 
ship having an arrow beneath it. The suhjec 
traces from ship to ship in accordance with pe 
pattern of flags being flown by each Hp mara 
problem continues until the subject returns to 
a ship used previously. 

The subject is pre- 


& © 7 "ientation. E 
15. Spatial Orientatic айги. "This 


sented with a large aerial phot 
is followed by several small photographs, — 
presenting an expanded view of one section a 
the large photograph. The subject must ш i 
nect cach small photograph with the proper 
position within the large photograph. 


N 
© 


16. Speed of Identification. Pictorial items 
are presented in which the silhouette of an 
object must be identified when it is rotated and 
imbedded in a group of similar silhouettes. 


17. Stick and Rudder Orientation. A series 
of pictures is presented to the subject show- 
ing changing views through the windscreen 
of an aircraft. The subject is required to 
recognize the corresponding change in the posi- 
tion of the aircraft and to indicate the move- 
ments of stick and rudder required to produce 
such a change. 


18. Verbal Comprenhension (Vocabulary). 
This is a typical vocabulary test requiring the 
matching of words and definitions. 


Apparatus Tests 


21. O'Connor Finger Dexterity. The sub- 
ject is required to pick up three small pins at 
a time from a tray of pins with the preferred 
hand and place them three at a time in a 
small hole. He must fill a series of small 
holes in this manner as fast as possible. Score 


is the number of pins placed. One 5-minute 
trial. 


22. Purdue Pegboard (Summation Score). 

Right hand: The subject is required to 
place a number of small pegs individually in a 
series of small holes as rapidly as possible 
with the right hand. Score is the number of 
pegs placed. One 30-second trial. 

Left hand: Same except with the left 
hand. 

Both hands: The subject is required to 
pick up two pins at a time, one with each 
hand from different trays and place them 
simultaneously in two different holes. Score 
is the number of pegs placed. One 30-second 
trial. 

Assembly: The subject is required to 
make as many completed peg-washer-collar- 
washer assemblies as possible in the time al- 
lowed. Score is the number of assembly com- 
ponents completed. One 30-second tria], 

A summation of the four 


Р 0 scores was 
used in the data analysis. i 
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(50.) Verbal Comprehension (Background for 
Current Affairs). This is an informational 
test covering current, recent, and historical 
events. 

19. Visual Pursuit. From a series of mazes 
or irregularly curved lines the task is to trace 
each line visually from its beginning to its 
proper termination point. 


20. Visualization of Maneuvers. A picture 
of an aircraft is presented. This is followed 
by three descriptions of changes in the aircraft 
position. The subject then selects from among 
five alternatives that picture of the airplane 
which represents its final position following 
the described maneuvers. 
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23. Minnesota Rate of Manipulation. 

Placing: The subject is required to 
place 60 cylindrical blocks in the proper holes 
as rapidly as possible. The score is the num- 
ber of blocks placed. Two 35-second trials. 

Turning: The subject is required to re- 
move the blocks from the holes with one hand, 
turn them over with the other hand, and re- 
place them in the same holes, moving from 
block to block as rapidly as possible. Score 
is the number of blocks turned. Two 40-sec- 
ond trials. 


24. Two-Plate Tapping. The subject is re- 
quired to strike two adjacent metal plates with 
a stylus as rapidly as possible. He strikes the 
plates successively; that is, first one and then 
the other, making as many taps as possible on 
the plates in the time allowed. The number of 
taps is recorded on counters. Three 30-second 
trials. 


| 


25. Precision Steadiness. The subject is 
Seated before a long, rectangular box-like ap- 
Paratus containing two openings. Each open- 
ing is the entrance to a straight passageway 
Which the subject must negotiate with a long 
stylus. He moves the stylus forward at slightly 
below shoulder height and at arm's length. 
He must move the stylus slowly and steadily 
away from his body trying not to hit the qo 
of the cylindrical passage. As he reaches the 
“nd of the passage he strikes à contact и 
and Withdraws the stylus, again trying to avoid 
litting any part of the passageway, He then 
negotiates the second passageway. Two bt 
Ue negotiations constitute à trial. а = 
did the number of contacts and clocks La 
trial. the number of times in contact У? 

$, no time limit. 
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26. Direction Control. Patterns of lights E 
appear in the form of a cross (to indicate 5 
the "heading" of an airplane). The pattern 
may appear in different parts of the display а 
panel and may represent a small ог large | архе 
cross, depending upon the direction of head- c Po gre d 


jp———————————————— — 


ing. The examinee must use both hands con- | | | > 

currently in manipulating the proper combina- aw. 8 

tion of switches or switches and buttons. А PX “ә \ 

new aircraft appears upon proper manipula- à S 

tion. Four 2-minute trials. be _ à * \ 
27. Control Sensitivity. ‘The subject is re- EET ETT 
quired to match the position of a red light EEE 
with a corresponding green light. The position 

of the lighted green light is controlled by a 

highly sensitive control stick. A slight move- 

ment of the stick to the right displaces the 

lighted green light to the right and, similarly, № 
for left movements. When the subject has | 
matched the two lights and held this position ! eod 
for .5 second, the red light moves to à new | \ 
position and the subject again proceeds to ا‎ ——À 
match it. Score is the number of completed | ` 
matches in four l-minute trials, ME CE i 


28. Ten Target Aiming. The subject is 


holes arranged at equal intervals in an elliptoid 
pattern. Behind each hole can be seen a 
circular target. These targets vary in size 
from hole to hole. The subject is required to 
strike at these targets with a stylus moving 
from target to target around the pattern of 
targets in a clockwise direction. He is in- 
structed that both speed and accuracy count 
and that he must try to hit as many targets as 
possible, striking only once at a target. Error 
counts are recorded each time the subject 
strikes the outside of each hole or inside 
around the target area. Correct counts are 
scored for hits within the target area in each 
hole. Six 30-second trials, x - 


seated before an upright panel containing 10 | 7 


oo ™* 
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29. Bimanual Matching. A red row of 
stimulus lights has a parallel row of green 
lights above and below it. The subject must 
match lights simultaneously in each row of 
green lights with the red lights through move- 
ments of two handles. The movements of 
each hand sometimes correspond and some- 
times are antagonistic to each other and bear 
a complex relationship to the position of the 
stimulus lights from setting to setting. Score 
is the number of simultaneous matchings made 
and held for .5 second during each of two 
2-minute trials. 


30. Track Tracing. The subject is required 
to negotiate an irregular slot pattern with a 
T-shaped stylus held at arm's length. Score 
is the number of errors (contacts with the 
back, top, and sides of slot) during four at- 
tempts. No time limit. 


31. Two-Hand Coordination. The subject 
attempts to keep a target follower on a small 
target dise as the target moves irregularly 
ànd at varying rates. Movement of the target 
Slower to the right and left is controlled by 
9ne lathe-type handle, movement to and from 
'S controlled by the other. Consequently, 
Simultaneous rotation of both handles moves 
the follower in any resultant direction. Score 
15 the total time-on-target during four l-min- 
Ute trials, З 


| 
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32. Choice Reaction Time. The subject is 
confronted with a panel containing a central 
light. He has two levers, one for the right 
hand and one for the left hand, and two foot 
pedals. He responds appropriately to any of 
four stimuli as follows: 

red light—right lever forward 

green light—leít lever back 

buzzer—right pedal depressed 

bell—left pedal depressed 
The subject is allowed 3 seconds in which to 
respond. The central light extinguishes when 
the correct response is made. 


33. Complex Coordination Test. The sub- 
ject is required to make motor adjustments of 
an airplane-type stick and rudder in response 
to successively presented visual patterns. A 
correct response (movement of stick and rud- 
der controls to proper positions) is not accom- 
plished until both the hands and feet have 
completed and maintained the appropriate ad- 
justment. A new pattern appears as each cor- 
rect response is completed. Score is the num- 
ber of correct responses completed in a given 
test period. Four 2-minute trials. 


34. Pursuit Confusion. The subject at- 
tempts to keep a stylus on a variable Speed 
target as it moves through a diamond-shaped 
slot. The task is complicated by the fact that 
the entire target area is visible only by mirror 
vision. Score (corrects) is the time-on-target 
during the six l-minute trials. Error score is 


the amount of time the subject is in contact 
with the sides of the slot. 


PREDICTORS OF COMPLEX TRACKING BEHAVIOR 


35. Motor Judgment. The subject is con- 
fronted by two adjacent discs rotating at a 
constant speed. Each disc has black and white 
sections on its perimeter. Between these discs 
is a pointer whose speed of rotation the sub- 
ject can control. A forward movement of this 
stick slows the pointer and a backward move- 
ment speeds up the pointer. The subject can- 
not stop the rotation of the pointer completely 
and he can exert no control over the two 
rotating discs. The subject is required to 
manipulate the control stick so as to make as 
many revolutions of the pointer as possible 
without crossing the black areas on the ro- 
tating discs. To do this properly he must in- 
tegrate his estimates of speed of each disc, 
the pointer, and his own control movements. 
Score is the ratio of the number of errors to 
the number of pointer revolutions during four 


1-minute trials. 


36. Discrimination Reaction Time. The sub- 
ject manipulates one of four toggle switches as 
quickly as possible in response to a series of 
visual stimulus patterns differing from one 
another with respect to the spatial arrange- 
ment of their component parts, е.5., position 
of a lighted red lamp relative to а lighted green 
lamp. Score is the accumulated time of те- 
Sponse for four series, each series containing 
20 reactions for each stimulus pattern (sub- 
ject must respond within 3 seconds). 


37. Rotary Pursuit. The subject pec 
9 keep а prod stylus in contact with a sma 
Metallic target set in а rapidly revolving 
PRonograph-type disc. Score is the total time- 


On. e 
“target during five 20-second trials. 
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38. Reaction Time Test. 


Visual: The subject keeps his finger on 
a button, depressing it as rapidly as possible 
in response to a single amber light before him. 
A click provides him with a ready signal be- 
fore each light stimulus is presented, with 
the fore period (between click and light) 
varying in a random order from .5 to 1.5 sec- 
onds. Score is accumulated reaction time for 
a series of 20 reactions. 

Auditory: Same procedure except that 
the response is to a buzzer. 

Jump Visual: Same procedure except 
that the subject does not keep his finger on 
the response button but keeps it on a cross 6 
inches from the response button. He must 
move his hand to the button as rapidly as 
possible as each light stimulus appears. 


Jump Auditory: Same except response 
is to buzzer. 


39. Steadiness Aiming. The subject must 
keep a delicately balanced stylus centered in a 
small hole. Any contact with the sides of the 
hole is recorded on a clock. The score is the 


number of seconds in contact. Six 40-second 
trials. 


40. Single Dimension Pursuit. 
makes compensatory adjustments 
movements) of a control wheel 
keep a horizontal line тат 
deviates from center in irregular fashion. The 
control wheel 15 damped pneumatically, intro- 
ducing a lag into the system, Store is the 
time the horizontal line is held ша null Osi- 
tion during the four I-minute trials, = 


The subject 
(in and out 
in order to 
null position as it 


Ым—&— жа —.- 
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41. Rate Control. А vertical target line 
moves back and forth across a curved scale 
with frequent changes in direction and rate 
of movement. The subject attempts to keep 
a thin pointer in coincidence with this line by 
adjustive rotary manipulations of a knob con- 
trol. Score is the total time the pointer and 
target line are in coincidence during eight 
I-minute trials. 


42. Rotary Aiming. The task is to strike at 
à series of buttons arranged in a circular pat- 
tern on a horizontal panel, going from one but- 
ton to the next as rapidly as possible. Score 
is the number of strikes in four 30-second 
trials, 


43. Rudder Control. 
Center Target: 
Mock airplane cockpit w! 
"Cep aligned with one of Я 
3 g с ы wn 
15 they come on in front ot him. "s us 
Weight throws the seat off-balance aint by 
"Dplies and maintains proper correc only 
Means of foot pedals. In this <= total 
re center target is used. Score 15 ps ped 
Ime the cockpit is held aligned = i 
er light during three 30-second trials. as 
al Triple Target: Same pirogedura ro- 
move except the subject must utilize app 


quse Pedal control to shift the o з 
ÉR 3 ‚ опа SRM. 
doi, ight to another as these d time the 


intervals, Score is the total | я 
A > ‘Beet r- 
it is lined-up with the prope! light du 
e 112-second trials. 


The subject sits in a 
hich he attempts to 
three target lights 
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APPENDIX C 


Brock DiacramM or TRACKING DEVICE 
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APPENDIX С 


UxnorATED Factor Loapincs 


JAMES Е. PARKER, JR. лхо EDWIN A. FLEISHMAN 


FACTOR 
VARIABLE №. 1 W ш ow v м уп үш IX X Хх xu хш xiv XV h 
Aerial Orientation 1 60 -25 04 -06 -15 -01 - 08 10 21 -i7 -05 0 05 05 55 
Complex Movements 2 p бИ 03 в м 05 0 -02 06 -0 п (09 22 -08 -03 45 
Coordinate Movements 3 5 -18 18 13 05 13 02 -05 16 16 05 05 0 01 05 49 
Directional Control 4 50 OF © A (9 15 i -M 23 16 02 фм. 
Discrimination Reaction Time 5 50 и 0 24 -13 10 17 03 A13 -l6 — 15-01 y -05 00 4 
Following Directions 6 40 -3 21 11 -0 07 06 -06 -19 _ 10 12 02 -04 -03 15 4. 
Forced Landings 7] 5 з р р 09 09 0 0i 15 -08 -0? -09 -36 07 00 51. 
Formation Visualization 8 6 -3 09 -03 15 -08 -21 13 -09 -l1 02 -08 08 03 -09 71 
General Mechanics 9 64 -19 -13 -19 00 -28 -10 05 -10 -08 -06 04 -13 -15 13 66 
Instrument Comprehension 10 70 -25 05 0% -0l 00 0 03 -01 -10 -12 -08 07 09 01 61 _ 
Mechanical Comprehension ll ^8) -33 06 5 м д 8 05 04 0 -02 -03 -17 -15 07 62 _ 
Pattern Comprehension 12 65 -25 -04 05 18 -12 -27 18 06 -06 10 08 00 -10 -06 67 
Planning A Course = ә 07. 04 17. 15 "4 ات 0 ا1ل‎ 27 03 -16 05 92 
Signal Interpretation 14 4 -22 08 м :09 7 2 o 02 O -i4 16 -1 м -2 5l 
Spatial Orientation _ 15 A? 09 —10 b $00 09 -13 -06 -28 -04 м 05 06 15 -03 40 
Speed of Identification 16 ML 18: 523 36 Чә ON 18 O -24 —-02 17 dy) им 
Stick and Rudder Orientation D 85 - > 00 Of -2 -0 12 13 08 op O7 -02 68 
Vocabulary 18 22 -35 31 07 -34 -38 01 -9 14 Е -l 5 3 
Visual Pursuit 07 -06 -10 -06 -04 13 6 
'isual Pursuil 19 61 -05 -01 10 02 12 02 -10 -22 16 12 02 -01 -06 -15 52 
Visualization of Maneuvers 20 63 -29 OL OL -12 12 -22 00 -07 -07 -02 62 
Finger Dexterity 21 25 30 = = 05. 16.05 TM 
Unus РЕГ 33 34 9 5 18 -19 -16 01 -03 10 18 -01 -08 -12 -27 4 
Е = 5 15 -06 -18 -03 - Е Е Е 
Rate of Manipulation (placing) — 23 14 51 -05 37 05 -12 -18 0 v e s D а и F 
Rate of Manipulation (turning) 24 28 45 -11 26 -04 -06 -13 -08 -04 -03 -277 п 01 ES a 49 
Two Plate Tapping 25 —25 45-08 16-12-01 -03 1208 02-24-01 -0i 06 06 42 
Precision Steadiness (errors) 26 40 20 04 -08 08 -11 08 20 -13 23 15 05 -0 : 
Direction Control 27 57 -10 -09 07 05 0 00 08 13 -07 -03 -15 я 03 4 E] 
Control Sensitivity 28 39 35 08 -1 -18 02 11 -02 -12 -31 -11 -04 -04 m 
Ten Target Aiming Cerrors) 29 -03 25 34 -09 -49 2 -25 04 13 03 00 06 -13 ^s 12 aI 
Ten Target Aiming (correct) — 30 18 46 20 -11 -35 17 -23 28 -03 09 12 6 0? -05 E: 
ИЕ ; 31 51 15 -23 -10 -Ol п 15 08 -07 08 -02 -03 -06 08 06 © 
errors) 32 46 20 -06 -08 11 -22 15 21 -05 (07 09 
Two-Hand Coordination 33 51 15 -20 -21 -0 [EE 
ә : -20 -21 -0] 09 08 6 08 07 м -3 -01 -08 -13 4 
a Time 34 д 24 23 V -05 - 21 19 04 -21 -0 -0 00 14 -16 A 
Complex: Coordinator 35 59 24 -01 -9 -09 04 6 02 -i7 -10 -05 -12 02 -09 -08 54 
te nfusion (errors) 36 50 WD 3 20 бй @ 04 6 п Ө 16 1 20 
Mor ЕП (correct) 37 48 15 -29 -10 -06 03 07 -23 08 -i8 l0 m7 EU i и 
Discrimination aac S IP 1a 522 x оц йз. ы ш.и 18 T ee. 
оп Reaction Time 39 52 18 10 -03 м 09 1 05 05 -03 27 
Rotary Pursuit 40 -03 -03 15 -24 14 03 04 07 05 45 
Visual Reaction Time 41 = F E ED a E E ee ШЕЕ ШШ ШЕШ s 
T = E | Т ЖШТ 
Auditory Reaction gus 42 09 26 35 -17 29 03 03 -08 п 0 4s 2 15 -09 à 0l 45 
p Visual Reaction. Time 43 l1 38 36 -14 12 03 -18 -28 00 -04 16 -07 -18 01 40 
Jump Auditory Reaction Time 44 22 33 31 -32 24 -12 -15 -23 -03 NL. crop. Usu 
Steadiness Aiming Ao 14 P 2010 -3 (0 —0 (06 22 -001 -05 -08 м 13 05 54 
Single Dimension Pursuit 46 24 14 -0 -09 -M 01 -13 -32 = E. Е E ии 8 
Rate Control 4 45 20 05 -27 00 -02 09 -15 -M 12 -08 -0l b -05 -08 26 
Rotary Aiming 48 07 4 1l 14 03 -I3 06 00 02 21 -00 -13 0 D -w 42 
Rudder Control 49 55 23 -25 -25 -15 -0 1 03 -03 13 0? B: sop. Ай 80 
Current Affairs 50 09 -29 33 21 -25 -41 15 -12 02 -065 04 001 02 -06 -03 5 _ 
Note.—Decimal points have been omitted, 04 23 08 (09 06 59 
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APPENDIX H 
Roratep FACTOR Loapincs 
FACTOR 
VARIABLE N. I n ш у у VI уп мш х хм хп хш XIV ху лт 
Aerial Orientation — — 1 52 0$ Ol 0 -I12 25 05 05 20 26 и 13 -07 
Complex Movements (2 16 10 -15 0 13 01 34 30 13 04 04 1 D 
Coordinate Movements 3 36 00 -05 п -0l 18 40 6 м 04 -03 47. 
Directional Control RH sy 19 = 05 =07_ 04 30 -N 11 20 -2 45 
Discrimination Reaction Time — 5 07 — 30 0 15 -06 13 25 01 02 -04 1 45 
_ Following Directions — — 6 27 -10 -Ol_ 0 04 09 27 06 20 -16 -08 40 
` Forced Landings — 7; 29 12? - 0 08 13 10 -03 35 20 -15 48. 
"Formation Visualization — — 8 33 -ll -03 -03 15 61 19 -06 8 04 12 70 
~ General Mechanics — S 3 % 4 2 19 з 0 02 4 о a 6 
_ Instrument Comprehension 10 4 09 -6 0 08 33 14 o 16 -05 13 59 
_ Mechanical Comprehension n 3a -03 07 -06 11 38 17 -06 47 03 12 58 
Pattern Comprehension m 12 16 17 58 32 00 27_ 16 M С 16577 
“Planning A Course 13 31 13 12 23 22 24 -6 -6 5. 
_ Signal Interpretation — 14 45 05 зз 25 19 01 05 4 
Spatial Orientation —— — 15 3 05 24 -00 10 -03 (05 01 38- 
Speed of Identification 16 09 17 08 10 00 5 
Stick and Rudder Oriental 1753 -w 05 02 01 dE УЙ 
NY . 18 -18 -18 13 -04 -03 13 02 60 
— 19 35 03 00 07 -02 16 18 50- 
"Visualization of Maneuvers 20 46 — 08 07 05 8 i - 0 6&6 
. FingerDexterily — Оп -0 0 -0 0 05 49 16 3% 
. Purdue Pegboard _ 2» 14 3 -13 33 07 43 -02 5353 
_ Rate of Manipulation (placing) _ —23 -10 35 -13 38 T E 4 49_ 
. Rate of Мапіри! (turning) _ m в 33 -13 A 04 17 =. EJ 
в 734 08 08 937. 
-10 00 26 45 
A 01 -M 24 50 
- Ten Target Aiming, (eror) — MER d 1i = = 5 ^ 
. Ten | Target Aiming (correct) mI 05 9 0 4 
31. 42 з A 43_ 
= -16 d8 17 a4 35. 
-E ет 25 o v 15 33 
"m hoice Reaction Time — = 07 20 02 38 53 
'omplex. Coordinator = 2 -04 ми 03 46 
_ Pursuit Confusion (errors) 36 -08 28 18 16 49 
02 -M 13 20 27 
07 08 из 
-05 08 04 12 53 
21 13 -02 09 4. 
42. i0 15. 05 41. 
-02 -M 10 -14 43. 
13 1 -06 20 1 57. 
40 _ 08 | 05 05 14 28 
-% 12 11 -001 08 -03 10 20 25. 
1 15 03 01 0 -2 0 25 4 
03. 1 I1 -21 y 0 ?6 


~ Current Affairs 
Integrated Error Score - 
~ Integrated Error Score 
Integrated. Error Score _ 
~ Integrated. Error Score _ 
Dlegrated Error Score 
ntegrated Error Score 
NLIS Error Score 
м Error Score _ 
РШЕ Error Score _ 
быс Еггог $соге 
~ Horizont Error Score 
А rontal г Score 

tizonta] Error Score _ 


топ; 
Zontal Error Score 
Orizi а 


tal Error Score 5 


09 0 03 18 03 9 08 40 5 
19 15 -05 -0 19 -08 -12 0 58. 
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APPENDIX H—Continued 


FACTOR 
VARIABLE No I по d м V м Vi vil IX X XI XH хи XIV XV 
Horizontal Error Score 6 6 21 14 02 10 16 0 04 00 OS бї 13 02 (09 I 13 
Horizontal Error Score — 7 67 21 03 05 -05 03 т 14 03 16 00 07 м -03 -1 19 
Horizontal Error Score 8 68 14 03 00 -05 -02 10 00 оз 04 м 10 07 -07 -03 20 
Horizontal Error Score 9 59 13 12 (04 -05 19 06 -05 19 04 -10 08 10 03 02 16 
Horizontal Error Score 10 70 19 -01 03 -07 10 09 1 -15 04 02 0 10 03 -0l 26 
Vertical Error Score 1 7 20 0 -10 0 2 10 02 -03 -03 -02 08 10 -15 -03 14 
Vertical Error Score 2 — 72 17 07 œ 06 03 в 02 00 o o 05 œ -08 0 15 
Vertical Error Score 3 73 07 0 05 00 -04 16 09 -05 -06 0 09 09 0 0l 08 
Vertical Error Score 4 74 13 08 03 -03 06 06 24 0 -03 05 17 05 -09 -15 05 
Vertical Error Score 5 75 18 18 -08 10 o 05 19 6-0 10 02 -02 -03 -03 15 
Vertical Error Score 6 76 19 09 2 4 17 10 13 -0 07 06 10 o 08 -04 13 
Vertical Error Score 7 7] 08 03 05 -07 м (09 24 ©% 05 06 12 -07 -05 -01 23 
Vertical Error Score 8 78 10 00 04 -03 o 00 01 2 02 0 13 08 02 04 11 
Vertical Error Score 9 7901 о 02 -02 04 зи -0 o 2 18 04 16 06 16 
Vertical Error Score — 10 80 10 05 -02 -03 05 15 -06 -19 00 03 -08 08 10 03 26 
Sideslip Error Score 1 81 12 -0l 04 00-01 13 © п 09 -02 02 02 18 -0 09 
Sideslip Error Score 2 82 08 -08 -04 à 02 -08 16 -05 з 14 -03 м 02 08 -15 l 
Sideslip Error Score 3 83 03 -10 06 -03 01 -06 9 м о © (09 05 15 0l -0l 
Sideslip Error Score 4 84 05 -08 -02 -03 -05 01 -0 -03 13 01 -0 06 00 -05 11 
Sideslip Error Score 5 85 1l -04 06 15 02 08 -20 -04 (04 -01 -03 №9 10 0 06 
Sideslip Error Score 6 86 0 03 08 à 05 -05 18 -01 -œ -07 04 0 2 м 03 05 
Sideslip Error Score 7 87 05 05 06 01 -03 25 М -0 œ -00 00 18 п 03 18 
Sideslip Error Score 8 88 и 0 08 02 15 2 17 œ 00 (05 04 22 -04 Ol 16 
Sideslip Error Score 9 89 12 -01 12 -03 12 0) V -0 05 08 йй 19 i4 я 
Sideslip Error Score 10 90 0% 0 % -ll 13 16 10 2 -95 бозуй 0б И OT 21 
Time-On-Target Score — 1 31. 22 6 00 (05 15 08 б o -16 -00 02 13 [2 -03 
Time-On-Target Score — 2 жп 08 o 03 юм 05 0 05 11 09 -08 08 
Time-On-Target Score — 3 з 1 12 в -01 и 0 721 -0) -07 а О ро 0 б 
Time-On-Target Score 4 94 19 08 -02 -01 17 -0i 18 10 03 07 03 07 00 -18 09 
Time-On-Target Score — 5 95 30 14 01 02 06 02 25 17 м 06 -02 -05 я 
Time-On-TargetScore — 6 5 27 12 00 05 15 10 10 00 -0 7 03 ва ть 
Time-On-Target Score — 7 57 19 10 05 -04 ^10 13 12 œ o o 10 в 07 4% 23 
Time-On-Target Score — 8 98 17 02 11 -06 08 18 15 02 05 11 13 20 03 -02 22 
Time-On-Target Score — 9 9 п B 07 03 16 14 15 0 -0 O йй 2 i 1 2 
Time-On-Target Score 0 100 25 05 03 -05 19 17 -0i -i9 -08 6 66 13 16 05 3 
Note.—Decimal points have been omitted. 
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SOME VARIABLES AFFECTING PATTERN CONCEPTION 


EVA 1. SHIPSTONE! 
Harvard University 


ie approach to the study of concept 
formation is based on the assumption 
that there is an implicit conception of 
structure (patterning behavior) in adult 
human beings and that this can be classified 
into hierarchically organized categories. In 
order to give a more definite conception of 
what this means, we begin with an anno- 
tated protocol recorded for a single sub- 
ject. The procedure described in this 
sample protocol is slightly different from 
that used in the main experiments de- 
scribed below: it was chosen, however, in 
order to unravel for the reader in some 
detail the process of strategy construction 
in concept formation. This technique 15 
replaced by a simpler and more concise one 
in the experiments to be described later. 


AN ILLUSTRATIVE STUDY 
( Experiment Г) 


The subject sits at a table; across trom 
him is the experimenter. The subject has 
just finished reading the following instruc- 
tions: 
an army base was rce- 
ceiving important code information, one message 
at a time. This information had to be sorted 
for filing as it was received. Now suppose you 
were the officer on duty with this important job 
of receiving and filing this information. You 
will now be presented with the same aer 
lion, single message on single card, two cards 
at first, then one card at а time. You are to 
study the card and then decide how you are to 


file it. You can put the cards into as many pile: 
are free to move 


During the last war 


Or boxes as you wish and you 


.' The author, now teaching at Isabella. 1 hoburn 
йере, Lucknow. India, wishes to acknowledge 
‘er indebtedness to George А. Miller for ing 
m. research, Without his invaluable Les = 
тез could not have been written. Gratitude i 
| 50 expressed to Е. В. Newman, Chairman O 
le department, for liis suggestions 


around а card from one box to another апу 
time you wish. You will not be limited in time 
since the decisions you make are very important, 
but work rapidly. Read these instructions again 
until you understand them thoroughly and read 
them again in between if you should forget 
them. 

Any questions? 

Now, one further point. As you work with these 
cards, think aloud as if you were talking to your- 
self. Talk freely about what you are doing 
throughout your performance. т 

Any questions? 

“Неге,” s the experimenter, "are the first two 
cards.” The subject is handed two cards, 5" X 8", 
on which are typewritten: ZIR PAG ZIR ZIR 
PAG NEN; and ZIR PAG N PAG NE 


PAG NEN PAG NEN PAG NEN PAG NEN 
PAG ZIR NEN. Only three nonsense words, 


NEN, PAG, ZIR, arranged in various different 
orders, are typed on any of the cards; it is con- 
venient to abbreviate them, N, P, and Z. The 
subject looks at the two cards, laughs self- 
consciously, and places them separately on the 
table in front of him. 

He says, “It seems as though the main difference 
in the way things repeat is first according to the 
symbols that appear and then in the order in which 
they appear.” 

There is a silent pause. The experimenter says, 
“Now you will have to tell me when you are ready 
for the next card.” 

“Next card, please.” 

The next card contains the message ZPZN. The 
subject places it on the table by itself. All three 
cards are separately placed, so that the experi- 
menter asks, “Why did you place that card there?” 

“Oh, well,” the subject says, “there seems to be 
a difference in the lengths of the messages. And 
] don't have enough information to really decide 
on categories. І am just setting them aside until 
| gather a little more information." 

“The experimenter, who has seen some people 
base their entire classification on the length ot 
the messages, makes a special note that the first 
basis of classification mentioned by the subject is 
Vill he stay with this aspect, or will he 


` 


length. V 


look further? | 
gm card is ZPZZZZZZZZZZZZZPN. The 


subject says, “Now this card is more similar in 
length to one of the other ones 1 received, and 
it contains a large number of ZIRs as the other 


2 EVA I. SHIPSTONE 


one, so I will put these two together, possibly 
in the same category" He places together 
ZPZZZZZZZZZZZZPN and ZPZZPN;; since he 
does not combine ZPZZZZZZZZZZZZZPN and 
ZPNPNPNPNPNPNPZN, he seems to have re- 
jected length as a basis of classification. For the 
moment, apparently, his attention is caught by 
another aspect, the repeating unit ZIR. 

The next card is ZPNPZN. The subject reads 
it silently. There is a long pause. "Whatever you 
are thinking," says the experimenter, "please say 
it aloud. I know that is not easy." 

"Well" says the subject, "I just noticed that 

all of the five cards start with the same syllable 
and the syllables appear to be ordered. Similarly, 
the second syllables in all the cards are the same. 
A difference arises in the third syllable, so there 
is a possibility of arranging these cards according 
to the syllables. Alphabetically, in other words. 
But for the time being I will put this card in 
with the one which has similar length, that is, with 
the same number of syllables, like six or so.” 
He proceeds to file them by length, as follows: 
ZPNPZN and ZPZZPN are put together, ZPNPN- 
PNPNPNPNPZN and ZPZZZZZZZZZZZZZPN 
are put together, and ZPZN is left by itself. Al- 
though he is beginning to pay attention to the 
way the messages begin, and to consider the 
possibility of an alphabetical ordering, he rejects 
it in favor of a classification on the basis of 
length. He has destroyed his first tentative group- 
ing on the basis of a repeating unit and has 
returned to the earlier hypothesis. 

The next card is ZPPN, 
subject, "all the cards Start with the same two 
syllables. But I am going to set this card aside 
for a separate category for no other reason than 
that I want to gather more cards before I make 
a classification.” There is a pause. “Am I to 
assume that this is coded information? I mean, 
information like language information ?" 

The experimenter replies, “I am sorry I cannot 
answer that question for you. These are messages 
you are receiving and you are trying to file them.” 

“Messages are coming in and I am trying to 
organize them. Hmm .. . temporarily, I am 
going to rearrange the cards and put them in 
alphabetical order according to the Syllables, start- 
ing from the beginning." The subject seems to 
hope that something not presently obvious will 
appear as he tries different spatial arrangements 
of the cards, As he works at alphabetizing them, 
he says, "I've just noticed something that had not 
occurred to me before, that there are only four 
symbols." In fact, of course, there are only three. 
"Somehow," he continues, "in the beginning I 

assumed there were going to be more, So, now I 
am arranging them first of all according to sylla- 
bles, that is, the syllable order and the first three 
syllables, and then according to the lengths of 
the messages." Since the first two syllables are 
identical on all the cards tl 


"Again," says the 


hat he has seen, any 


attempt to alphabetize must take into consideration 
at least the first three syllables of the message. 


The experimenter notes that the subject is be- 
ginning to combine his rules in an effort to find 
some new basis for classification. Like most sub- 
jects, this one began with relatively simple rules— 
counting and alphabetizing—ready-made operations 
he has learned in other contexts; he transfers 
them bodily to the present task. For some reason, 
however, the results do not satisfy him and he 
is now beginning to search for something else. 
As he collects more cards, he wants to come to 
terms with them in a way that respects their 
peculiarities and particularities, not by some ready- 
made rule that might fit almost anything. | 

As shown in the chart below, the next card is 
ZPNPNPNPNPZN. The subject studies this 
card in silence for almost 60 seconds, looking back 
and forth between it and other cards spread out 
on the table before him. Then he makes a new 
observation, "I notice now that when I look at 
these cards in categories that there are units within 
each card that seem to repeat between cards. 
Like, for instance, I have two cards here both of 
which start with ZIR PAG NEN and end in 
ZIR_NEN, and in between they have the unit 
PAG NEN, but in slightly different numbers. I 
mean, one card will have more of these units than 
another card. It seems like relationships are 
starting to emerge.” 

The more complex rule that the subject has 
just discovered is one that the experimenter has 
watched many subjects come to; the experimenter 
calls such formulations "Initial-Middle-Final Rule.” 
Although the rule is here applied to only two 
cards to form a single category, it contains the 
germ of the filing system that this subject will 
use for all the remaining cards. Note that it is 
а rule designed to fit the present situation and to 
use the particular information provided by this 
Set of messages; it is not a rule that would apply 
equally well to all imaginable sets of messages. 


The order in which the messages were presented 
to the subject was: 


1. ZPZZPN 


2. ZPNPNPNPNPNPNPZN 
3. ZPZN 

4. ZPZZZZZZZZZZZZZPN 
5. ZPNPZN 

б. ZPPN 
7.ZPNPNPNPNPZN 

8. ZPZZZZZZPN 
9.ZPZZZZZZZZZZZZZZPN 
10. ZPPPPNZNNNNNP 

11. ZPPPNZP 

12. ZPNZNP 

13 ZPNPNPNPZN 

14. ZPPNZP 


15. ZPZZZZZZZZZZNP 
16. ZNZNNNNP 
17. ZPNPNPNPN 


PNPNPNPNPZN 
ZPZZZPN | е 
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19: ZPZZ 

20. ZPNP NPNPNPNPZN 
2L ZPPPPPPPPPPNZNNNP 
22. ZPZZZZZZZPN 

28. ZPZZZZZLZZZZPN 

24. ZPNZNNP 
25.ZPNPNPNPNPNPZN 

26. ZPZN 

27. ZPNPNPNPNPNPNPNPNPNPZN 
28. ZNZP 

29. ZNZN P 

30. ZPPPPPNZP 


At this stage the subject has received seven 
cards and has arranged them into three sets as 
follows, which we shall label as A, B, C, for 
convenience : 

A. ZPZZPN 

ZPZZZZZZZZZZZZZPN 
B. ZPNPNPNPNPZN 
ZPNPNPNPNPNPNPZN 
C. ZPPN 
ZPZN 
The seventh card, ZPNPZN, is left isolated for 
the time being. 

The next card is ZPZZZZZZPN. “Again I see 
the same thing within cards,” the subject says, 
apparently with some satisfaction. “I see units 
repeating in the cards within each class. For 
example, there is an end unit, two end units, 
in fact, and the intermediate units are the same, 
Just varying in number. In other words, 1 count 
the number of ZIRs in the middle unit in both 
cards, It may be five in one case and six in 
‘nother, and this holds for all my cards, I think. 
In other words, in all cases the ends and the 
beginnings are the same. It is the middle that 
Varies,” 

ZZZZZZZZZZZZZZPN. 
he subject says, “This classification seems 10 be 

Working out very nicely, so I will continue.” 

_ The next card is ZPPPPNZNNNNNP. There 

15 а long silent pause; clearly, something "e has 

appeared. The experimenter reminds the subject 

Ю keep talking. 

“ 

АП right he says, 
“ard. I am trying to see W 
At in. | am looking at the 
ae again I see the same 
SS I notice now, as а si 
eo to have only three basi 
AS of these categories is divid 
8 consist of cards having the 
ing sidings and йе ане п sa ра 

n number." пе subjec aces Mus 
ZELPPNZNNNNNP, together with ZPP 
"i next card is ZPPPNZP. eon Ro 
Nou attern. I mean to collect more = 4 
more Jt continues. And as soon as 1. d dus 
to as cards to form my third category . am esci 
able | if there are any more relationships E ise 

*tween the major categories themselves. 


„„ The next card is ZP 


“now we have another 
hat category this will 
first three syllables 
pattern emerging, I 
side observation, we 
c categories. And 
led into subclasses 
same beginnings 


diate units, va 4 
d, 


The next card is ZPNZNP. “Oh,” says the 
subject, "now I notice that I made a mistake. 
I see that whereas I thought there were three 
categories I now see there are four. The end 
units of two of my cards in one of my classes is 
different." He indicates Set C consisting of ZPPN, 
ZPZN, ZPPPNZP, and ZPPPPNZNNNNNP. 
"In fact, there are considerably more categories 
than I thought. My first category (Set A) is all 
right. The beginnings and ends are the same, and 
the intermediates are the same, Similarly, my second 
category (Set B) is all right. Tust this one I was 
trying to make my third category seems to be 
wrong. It does not seem to follow the same pattern 
the other two do. I just assumed it did, but I see 
that it doesn't. I could attempt to follow the same 
classification, that is, instead of having three cate- 
gories, | could have five or six. In other words, I 
will continue to see if a pattern emerges. It looks 
like I will continue with this new category. It 
appears, though, that I may have to discard my 
old standards, or criteria." 

The subject has been given no hint as to how 
many categories there should be; concern with 
the number is his own attempt to define the terms 
of the task. The desire to have as few categories 
as the evidence will allow seems to be an important 
consideration governing many subjects' satisfaction 
with their own efforts. Moreover, like most sub- 
jects, he wants to use the same rules for every 
category; it feels inconsistent, or at least untidy, 
to have a different description for each different 
set of messages. 

Note here, incidentally, that it is his third 
category with a double variation in the middle that 
is causing his troubles. 

At this stage the subject has received 12 cards 
and has arranged them as follows (though not 
necessarily in the same order within classes) : 

A. ZPZZPN 

ZPZZZZZZPN 
ZPZZZZZZZZZZZZZPN ` 
ZPZZZZZZZZZZZZZZPN 


В. ZPNPNPNPNPZN 
ZPNPNPNPNPNPNPZN 


С, ZPEN 
ZPPPNZP 
D. ZPNZNP 
ZPPPPNZNNNNNP 
Two cards, ZPZN and ZPNPZN, are left as 
odd cards, not yet assigned to any definite category. 
t card is ZPNPNPNPZN. The sub- 
ject says, "I just noticed something т hadn't 
noticed before: that all the cards begin with 
ZIR." Has he rediscovered something he remarked 
on first after the fifth card? Or is he noting that 
his earlier generalization is still valid? He says, 
“There no longer seems to be the tendency to 
have the same end units." Perhaps he means that 
the beginnings of the messages are still predictable, 
even though the endings are beginning to seem 
more variable. Unfortunately, it is not always 


The nex 
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possible to understand what a subject means; to 
interrupt him by questions would interfere 100 
much with the processes the experimenter is trying 
to observe. 

The next card is ZPPNZP. “I noticed that the 
card I just received is like the first card in one 
of my categories, the simplest card, only this 
additional symbols at the end.” He places ZPP 
together with ZPPN and ZPPPNZP. “This could 
not occur with any of my other cards,” he says. 
"I will continue." Не means that the endings of 
two of his cards in a category could be different. 

The next card is ZPZZZZZZZZZZNP. “Now 
this card fits into one of my earlier categories. I 
will put it there.” 

The next card is ZNZNNNNP. There is a 
long pause. Then, “Now I am lost.” 

“Why are you lost?” 

"I don't see relationships occurring between these 
last five or six or seven or eight cards. 1 cannot 
see where 1 can fit them sensibly into the cate- 
gories. They don't seem to be fitting into the 
same nice pattern that the earlier cards did." He 
leaves this card unassigned to any of the groups. 

The next card is ZPNPNPNPNPNPNPN- 
PNPZN. "This one fits into one of my earlier 
categories. What I would like to see... I would 
like to get more information in order to decide 
whether or not I have to abandon my older classi- 
fication. Maybe these cards that I have will fit 
better if a card could be used to establish a new 
category using the same criteria as before. It is 
not apparent, however, whether this will be useful 
in sorting these materiali. We have sort of new 
materials here. The two categories that [ have 
seem to be filling in very nicely; they seem to be 


complete series, whereas these other cards don't 
fit anywhere, yet." 


The cards that аг 


€ giving our subject so much 
trouble all belong together; the confusion is 
arising because there are two variables in this 


set аз opposed to only one variable in each of 
Sets A and B. 


While the subject was 
another card, ZPZZZPN, 
of his earlier categories, 

The next card is ZPZZZZZPN. He greets it 
like an old friend and disposes of it quickly. "Now 
this one, too, fits into my older category, s 
place it there." 

The next card is ZPNPNPNPNPNPNPNPZN, 
another old friend. "And, similarly, this one fits in 
all right." 

The next card is ZPPPPPPPPPPNZNNNPD. 
A stranger, alas. "Now, I have this card, „1 
see another pattern emerging, I think. here 
аге... now, let me think . . . yes, it's just that 
І didn't discover what the intermediate unit was 

in some of these, assuming it to be two Syllables, 
like the more obvious one in the Other set," 
says, comparing this card with the ons just 
preceding it. “I see now that the end Syllable 
consists of just one—I mean, the ending consists 


talking he reached for 
which he placed in one 


o I will 


he 


of just one syllable, and the intermediate ones are 
more than one syllable. I can form another clas- 
sification, using the beginning three symbols and 
ending in one symbol, at least for some of my 
cards. I can fit them into my classification. In 
other words," he says, revealing once more his 
desire to find a consistent basis for the classifica- 
tion, “I have formed a third category which is 
similar to the other two larger categories I have 
(meaning Sets A and B). And I still have Íour 
cards left over." He refers to. ZPPN, ZPZN, 
ZPNPZN, and ZNZNNNNP as the four that arc 
leit over; then, returns to his new, or third, 
category in search of a simple rule: "Now, let's 
sce И we can see the same sort of relationship 
here. Ah, it seems to be emerging. There is an 
end unit which is constant and there seems to be 
а middle unit which seems to be constant. So we 
can have two ends and a middle; then, in between 
ends and the middle, there are other units which 
are the same. The same unit is added, only just in 
different numbers. So we have the same sort of 
process as before, except a little more complex. 
Now the question is, how to classify these cards. 
We will make a category out of all the cards that 
have the same two end units and the middle unit 
and I will classify them in order of their relation- 
ships depending upon the number of units in the 
beginning, and then the number between the end 
and the middle. If there is one unit, it will be the 
first card; if there are three units, it will be 
the second card, assuming that there isn’t one 
with two units in between. And if I have two 
cards with the same number of units repeating in 
between the beginning and the middle units, then 
I will use the second side—1 mean, the number of 
syllables that is between the middle and the end is 
my second way of classification. I will rearrange 
these now,” 

Notice how the subject 
variant and invariant units in the statement of his 
rules. The approach might now be characterized 
as Initial-x-Middle-y-Final, Where x and y герге- 
sent the variable parts. And he also has the rule 
well enough in mind to use it for ordering the 
cards within each category. 


distinguishes between 


"When I come to rearrange these," 
says, "I see this isn't the case.” 


“What isn't the case?" 


“That the two ends and a middle unit are con- 
stant with all the cards, and there js an extra 
syllable. So, I am going to wait.” 

At this stage the sub 
and has rearrange 
necessarily in the 

A. ZPZZPN 

ZPZZZPN 
ZPZZZZZPN 
ZPZZZZZZPN 
ZPZZZZZZZZZZPN 
ZPZZZZZZZZZZZZLZPN 
ZPZZZ22722222222PN 


the subject 


ject has received 21 cards 
ed them as follows (though not 
same order within classes) : 
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B. ZPNPNPNPZN 
ZPNPNPNPNPZN 
ZPNPNPNPNPNPNPZN 
ZPNPNPNPNPNPNPNPZN 
ZPNPNPNPNPNPNPNPNPZN 

C. ZPNZNP 
ZPPNZP 
ZPPNNPP 


ZPPPPNZNNNNNP 

ZPPPPPPPPPPNZNNNP 
Four cards, ZPPN, ZPZN ZPNPZN, and 
ZNZNNNNP, remain unclassified. Although he 
is making progress, he docs not feel completely 
confident about the third category. [At this point 
the reader should try, if he has not already, to 
fit the four odd cards into the existing categories ; 
he will discover that this subject, like the majority 
of subjects, finds it difficult to invent zero when 
he comes to count the number of times a unit 
recurs. ] 

The next card is ZPZZZZ ZZPN. "This one 
lits into one of my older classi cations, and in my 
first big category." 

"The next card is ZPZZZZZZZZZPN. "And this 
one, too. I will put it there." 

The next card is ZPNZNNP. “Now I notice 
these . . . here . . . no, ГИ see if it fits into one 
of my categories and if not, I will put it with these 
other cards and see if I can establish a new 
category with them. Now, this last card seems 


to be similar to two of my cards that I had set 
together with 


aside." He puts ZPNZNNP it 
ZPPPPNZNNNNNP and ZPPPPPPPPPPNZ- 


NNNP. "They have the same beginning and the 
same end and pretty much the same middle unit, 
but not quite, however." 

The next card is ZPNPNPNPNPNPZN. “This 
fits into one of my older categories.” 

The next card is ZPZN: it is the same as the 
third card he had received earlier [pause]. “This 
one I will place with that and that,” he indicates 
ZPNZNP and ZPNZNNP. "There's . . . fa 


Pause] it just occurred to me that, although pe 
"ppears to be a relationship, it 15 probably only 
at there are only four 


by virtue of the fact th : 1 
symbols we are using.” The mistake persists; he 
still does not realize that, in fact, only „тес 
Symbols аге used. "However, he continues, "there 
does appear that there is an order. I mean, if 

were to divide it like I have two end units and 
middle units, it appears that there 15 anotar 
Syllable appearing in between the beginning ant 


the middle and it is always the БИП К. 
Just varies i t h like the other classi 
i aber, much lik 
Койсоң As a matter 


cation. So now I will order those. ага 
Of fact, I see that three of ту cards fit very nicely. 
"Ctually, there are four that fit in very nicely. 
"We 2, oh, no, this is a little different. „Тһе 
Спа unit is different, so I have "ow established 
‘he fourth category. And I still have two pris 
eft over that Lam not quite sure what to do with. 
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Let me check now and see if they don’t fit 
somewhere . . . no, they don't seem to fit." 

The. next card is ZPNPNPNPNPNPNPN- 
PNPNPZN. “This fits in very nicely with one 
of my earlier categories.” He disposes of it 
quickly. 

The next card is ZNZP. 
seem to fit anywhere.” 

The next card is ZNZNP. He pairs it with 
ZNZNNNNP, saying, “Now I have two odd 
cards that seem to fit to form another category, 
using the same idea of picking out syllables inter- 
spaced in between, just repeating. There is just 
one thing I want to say pertaining to one unit. 


“This card does not 


cards which, temporarily, I will call separate 
categories.” 

The next card is ZPPPPPNZP. The experi- 
menter says, "This is your last card." 

The subject says, "This one, I think, fits into 
one of my earlier categories. Now I want to 
look over these couple of categories and see how 
| stand on those. I want to make sure that I have 
really followed the rules I think I established.” 

“You can take all the time you want to," says 
the experimenter, "but don't forget to talk." 

At this stage the subject has received 30 cards 
ranged them as follows (though not 
n this order within classes) : 


and has 
necessarily 
A. ZPPN 
ZPZZPN 
ZPZZZPN 
ZPZZZZZPN 
ZPZZZZZZPN 
ZPZZZZZZZPN 
ZPZZZZZZZZZPN 
ZPZZZZZZZZZZPN 
ZPZZZZZZZZZZZZZPN 
ZPZZZZZZZZZZZZZZPN 


B. ZPZN 
ZPNPZN 
ZPNPNPNPZN 
ZPNPNPNPNPZN 
ZPNPNPNPNPNPZN 
ZPNPNPNPNPNPNPZN 
ZPNPNPNPNPNPNPNPZN 
7PNPNPNPNPNPNPNPNPZN 
7PNPNPNPNPNPNPNPNPNPZN 


С. ZPNZNP 
ZPNZNNP 
ZPPPPNZNNNNNP | 
ZPPPPPPPPPPNZNNNP 
D. ZPPNZP 
ZPPPNZP 
ZPPPPPZP 
E. ZNZNP 
ZNZNNNNP | 
Two cards, ZNZP and ZPZN, remain unclassified. 
“Okay.” the subject says. “Well, ГИ start out 
with mv biggest and main categories because the 
' so I'll make sure that they 


relationships are clear. 
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follow all the way through. Now, they all have 
Ше same beginning and end syllables. There is no 
question about that. And, similarly, the second 
and next-to-last symbol in all—the beginning and 
end units are similar in all cards. In my first 
category the intermediate symbols are always ZIRs 
and they occur in varying numbers. Let's see, 
one, two, three, skip some; I can order these 
according to the number of ZIRs in between. I 
don't suppose it matters as long as I am classify. 
ing rather than trying to interpret the information. 
I can't seem to decide between these: they scem 
to be homogeneous." 

He shifts to Set B. “PIL take the next one. 
Again, we have two syllables that are the same— 
I mean, they are the same from card to card, 
beginnings and ends. And we have in between 
them, rather than one syllable repeating, a pair of 
syllables repeating . . . and always ending in the 
same one. So I order these according to the 
number of insertions." ' 

He shifts to the Set С. "Now I am going to 
look at one of the more complicated categories. 
Here it seems there's a beginning unit that consists 
of two syllables on each of the two cards and the 
end unit which consists of only one Syllable" [short 
pause]. "In addition, there is a middle unit that 
consists of. NEN ZIR, or whatever it is. And 
in between there are varying numbers of syllables 
between the beginning units and the middle unit. 
In this category there are varying numbers of 
syllables PAG, so I am going to arrange them in 
this order. This will be my primary ordering 
process. And, should there be duplication in the 
number of РАС», then I would check the number 
of symbols on the other side of the middle part, 
and order them according to their number" [pause]. 

“I see that this does occur . . - 50, they are ordered 
first with Tespect to the symbols between the 
beginning and the middle, and then with respect 
to the symbols between the middle and the end, 
Should there be any ambiguity in the first order 

. it appears like a very пісе...” He combines 
his next three categories into one on the basis 
of the rule just Stated; the result is а third 
category : 

C. ZNZNP 

ZNZNNNNP 

ZPNZNP 

ZPNZNNP 

ZPPNZP 

ZPPPNZP 

ZPPPPNZNNNNNP 

ZPPPPPNZP 

ZPPPPPPPPPPNZNNNP 
"Now," he continues, "I have the cards in three 
main categories, and ГИ see if they will hold up. 
Again, they begin with, and end wi 
unit; and the same pattern of mi 
repeats." 


th, the same 
ddle syllables 


He shifts to the remaining odd cards, ZNZP 
and ZPZN. "Now let's see if my 


last category 
I'm not sure what to do wit is 


В that—make 


sure if it doesn't fit into one of the old ones vet 
it doesn't as such . . .” [long pause]. I'm glancing 
over all the categories now to make sure that this 
last category doesn't fit into a subset of those" 
[pauses while looking]. "I think. . . ." He picks 
up ZPZN. "I now notice that one of these cards 
does fit in to the, ah, my earlier category. ... 
Its curious, I didn't notice it before, but it seems 
to be identical with the first one in one of my 
categories. Well, ГИ place it there." 

There is a long pause while he considers the 
remaining odd card, ZNZP. “It seems a shame 
10 have a one-card category. ГИ look over once 
moge to see if we can find а place to put it... 
this odd card." 


After a little more search he gives up. "I will 
leave it as it is," he says, "I am happy with my 
classification." 

The above technique was abandoned in 
all the studies to be described below in 
favor of a simpler and quicker one: all the 
cards were simultaneously presented to the 
subject and no running protocol was re- 
corded as the subject sorted the strings 
Comments the subject wished to make on 
his method of sorting were recorded only 
at the end of the experimental session. 


Tue PROBLEM 


This account of a subject's responses to 
à problem solving situation has been recon- 
structed from a tape recording. The sub- 
ject is confronted with the problem of 
processing information which he received 
а little at a time. We can think of the sub- 
Ject as starting the task in total darkness, 
for he has almost no idea what kind of in- 
formation is encoded in the messages, What- 
ever he does with the messages is entirely 
a matter of his own decisions, 


After watching a dozen or more subjects 
perform in a similar manner, one 
pressed with several things. 


the similarity between this lab 
sode and ordinary 


is im- 
The first is 
Oratory epi- 
s 1 behavior shown in deal- 
ing with our daily environment. АП day 
long stimuli impinge upon an organism in 
а haphazard array: unless he can rapidly 
sort and file information in the right com- 
partments his life will quickly revert to 
chaos—to the “buzzing, blooming con- 
fusion" of the newborn child. Man orders 
his environment according to rules that he 
develops for himself or which he learns 
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from others; by this recoding technique he 
can economically handle the multiplicity 
and the variability of his environment for 
present and future use. 

А second impression one gathers is that 
men have learned to behave in this orderly 
manner without any specific instructions and 
without any knowledge of results provided 
by a teacher or an experimenter. Man 
seems to respect structure in the world 
wherever he finds it, regardless of сопѕе- 
quences. It does not seem necessary here 
to invoke any hedonic contingencies—un- 
less perhaps orderliness carries its own 
rewards: a satisfaction with the goodness 
of fit. The subject is uncasy when a string 
of symbols does not fit one of the categories 
he has evolved. What one means by the fit 
here is more subtle, and the lack of fit less 
obvious, than in a mechanical or a picture 
puzzle; without any outside guidance or 
control the subject moves from a search 
for isolated perceptual relations into а 
search for the conceptual pattern under- 
lying the stimuli that he receives. 

The behavior of our subject emboldens 
us to argue for an implicit conception of 
structure in man. The argument 1s, of 
course, not new to psychology. It was first 
brought into lively focus by the Gestalt 
psychologists as part of their protest 
against the Wundtian tradition of elemen- 
tism and against the behavioristic denuncia- 
tion of everything mentalistic. The Gestalt 
Psychologists did not share with Behavi- 
Orism its contempt for mentalistic concepts, 
пог its distaste for introspection. On the 
Contrary, they were against any oversimpli- 
fied mechanistic view of man; they urged 
the importance of value in a world of facts 
Kóhler, 1938). Meaningful relations 
(Kohler, 1929; Wertheimer, 1945), organ- 
ization (Köhler, 1929), fitness of parts into 
an organic whole ( Koffka, 1935), required- 
Ness (Köhler, 1938)—these were some 0 
the values they wished to restore to а world 
acked up into elements. | | 
he emphasis on relations is not а ше 
(ought, either, Even Thomas Brown 
nied in Woodworth, 1948) early їп б 
( песет century, and von Ehrenfels 

: ed 
later in that century, had stress 


the importance of relations. But the notable 
contribution of the Gestalt school was to 
demonstrate the crucial role of structure, 
or pattern, both in the stimulus configura- 
tion and in the responses of the observer. 
According to them the perceptual process 
is essentially one of organization. The brain 
is a dynamic neural center where perceptual 
objects combine or separate on the basis 
of similarity, contiguity, symmetry, com- 
mon fate. Köhler’s conclusion (1947) that 
an "organism responds to the pattern of 
stimuli to which it is exposed" (р. 103) 
seems to fit the illustrative account of our 
subject's responses to a decision making 
task. We cannot help but notice how often 
the subject is concerned with the “fit” of 
the messages into some category he had 
established: he is continually seeking “re- 
lationships," he talks about "pattern," he 
is trying to fit the parts into a total scheme 
he has set up to solve his problem. In fact, 
Wertheimer (1945) might have been de- 
scribing our own subject instead of the 
children who solved geometry problems, 
when he wrote: "In all these cases we have 
structural changes, tendencies toward the 
better structure, toward fitting, with the 
disappearance of disturbances" (p. 57). 
Since the notion of structure, or pattern, 
as proposed by the Gestalt psychologists, 
seems essential for the description of our 
subject's behavior, it becomes the starting 
point of our inquiry. The notion of “pat- 
tern,” however, as we use it here, is some- 
thing of a special case of the Gestalt notion. 
We are interested in what Miller and 
Chomsky (1958) have called “pattern con- 
ception.” For the purpose of the present 
experiments we define pattern as the invari- 
ance of a sequence of symbols in the con- 
text of lawful change, and we simplify its 
meaning by defining pattern in terms: of 
internalized rules, ог relations, a subject 
employs to organize novel information. 
Rules usually imply a notion of abstraction 
and generalization ; they go beyond the in- 
formation given. They cannot be said to 
be covered either by the transposition ех- 
periments such as those of Kohler (cited 
in Osgood, 1956, p. 495), nor by the studies 
of stimulus generalization conducted by the 


со 


Behaviorists (Hovland, 1937a, 1937; 
Razran, 1940, 1949). In both cases the ге- 
sults could be explained on the basis of 
sensory relations alone. Ву the use of rules 
the subject searches for pattern in order to 
recode the sensory input into manipulable, 
meaningful units. Perhaps a limitation on 
the information capacity of the human 
channel makes such behavior (recoding) 
necessary. Ín any case, it is our wish to 
find out something about these rules or pat- 
terns by which the subject handles informa- 
tion. We would like, therefore, to make a 
Systematic study of some of the experimen- 
tal variables that affect the nature of the 
rules that subjects employ in order to 
search for pattern in the stimulus input. 
One could begin a study of rules by ask- 
ing the usual questions of What? How? 
and Why? Hull (1920) is one among those 
researchers who tackled the question What? 
"What does a child learn," he asked, 
"when he calls a certain object dog?" This 
question is part of a broader question: 
What does any individual learn when he is 
said to have learned a concept? To find an 
answer Hull required adult human sub- 
jects to learn some Chinese sounds paired 
With certain characters in the Chinese 
alphabet. Hull’s conclusion was that what 
an individual learns is a rule about the 
common characteristics of a class of ob- 
jects. He placed the emphasis on the com- 
mon characteristics of the stimuli: the rule, 
Hull equated with discrimination learning. 
It is, therefore, not clear to the reader 
whether Hull's subjects learn a rule or 
whether they merely learn a name for the 
rule. 
А second answer to the question What? 
is contained in the word relations. Accord- 
ing to a second view what the subject really 
learns is some kind of a rule of relations by 
which he groups instances as members of à 
class. А search for common relations among 
instances might take the subject beyond 
the common perceptual elements to some- 
thing more. Гог the Gestalt thinkers this 
something more is the "organic whole" Which 
is more than the sum of its parts. Goldstein 
and Scheerer (1941) describe it as “aly. 


straction.” According to them the process 
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of abstration is different from that of ag- 
gregation of common elements under one 
label. Smoke (1932) discusses this same 
ilea as symbolic reaction. Our own ap- 
proach to rules is in terms of relations and 
we view rules of relations as pattern con- 
ception. 

The question—how rules are learned or 
made—has been ingeniously handled by 
Bruner, Goodnow, and Austin (1956), and 
more recently by David Reiss (1958). 
3runer et al. systematically investigated. the 
methods by which subjects categorize given 
events. The categories into which events 
are to be placed are predetermined by the 
experimenter's rules. It is the task of the 
subject to discover the experimenter's rules 
for placing the given events into their 
proper boxes. [n order to categorize suc- 
cessfully the subject uses "strategies," or 
plans, by which he searches for informa- 
tion which would be useful to him in con- 
structing his hypotheses about how the 
events are distributed into the experimen- 
ters boxes. This stepwise process—of 
Strategy construction, searching for infor- 
mation, hypothesis building, testing hypoth- 
eses—the authors describe as the process 
of concept attainment. Of these steps, the 
one of strategy construction is the most 
critical. Bruner et al. (1956) describe 
Strategy as “regularities in decision- 
making,” and define it as “a pattern of de- 
cisions in the acquisition, retention, and 
utilization of information that serves cer- 
tain objectives (р. 54). A “host of factors" 
influence the construction of Strategies in 
concept attainment, according to these re- 
searchers. 


Tn his discussion of the construction of 
strategies by subjects, Reiss takes Druner's 
"host of factors" for granted, and he pro- 
ceeds to study how a subject constructs his 
"model" of what the rules that govern the 
stimuli might be, but Reiss studies this 
without constraining the subject strictly by 
the experimenter’s rules. He varies the 
quantity of evidence given to the subject 
initially, and then observes how this amount 
of information affects the subject's be- 
havior in model attainment, Reiss finds 
that the subject's model is in fact seriously 
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influenced by the amount of information 
initially presented to the subject. 

In all the studies discussed above, except 
the last one, the experiment concludes when 
the subject reaches the solution as the ex- 
perimeter stipulates it. Hull's subjects 
continue to learn the list until the criterion 
is reached. The criterion, of course, means 
the one that the experimenter has set up. 
Hanfmann and Kasanin (1937) turn over 
а block or two to suggest to their subject 
the "correct" grouping which he must dis- 
cover. Bruner, Goodnow, and Austin con- 
firm and infirm hypotheses. In none of 
these studies is the subject ultimately free 
to set up a criterion independently of the 
experimenter. Obviously then, the question 
arises whether the subject will proceed to 
work by rules even when he is free from 
the experimenter's criterional constraints. 

Thorndike (1931) asked subjects to 
draw lines equal to a specified length with- 
out giving them any information about the 
accuracy of their results. Yet one notes in 
his results that the performance of sub- 
jects began to stabilize around some stand- 
ard implicit in the subjects’ behavior. In 
other words, the subject seems to be 
settling оп some rule which he bases upon 
his own performance. It is immaterial 
whether he does this consciously or uncon- 
sciously. 

One notices a similar reliance upon a 


subjective standard in Reiss’ subjects as 
« i 

they construct models. The random 
onfirmed 


Sroup, whose hypotheses were с 
or infirmed according to a prearranged ran- 
dom schedule, did not perform randomly 
at all. Instead, subjects constructed models 
quite early and began to test these on the 
basis of their own hypotheses, disregarding 
the information they received from the ex- 
Derimenter. 

Systematic responses, к 
abruptly shifted to new modes of syste- 
Matic behavior have been observed Бу 
Krechevsky (1932), Witkin (1940), and 
SPence (1939) in animal subjects. By 
Means of a four-point discrimination L4 
Paratus, Krechevsky presented an Wr 
Soluble” problem to his rats. These sub- 


which may be 


jects did not run around the apparatus in 
random fashion, however; they developed 
systematic modes of responses. Later, Wit- 
kin, substantiated these results on animals 
run in nonspecific problem situations, but 
his explanation was different. Spence ob- 
served the same phenomenon in chimpan- 
zees when he presented them with an 
insoluble problem in a multiple-choice ap- 
paratus. 

We have mentioned these studies for 
three reasons: first, because all of them 
employ a notion of a criterion of success 
which is externally determined; second, 
that this notion of an externally imposed 
criterion introduces some contingencies of 
reinforcement for the subject; and third, 
that when the contingencies of reinforce- 
ment occur haphazardly, the subject disre- 
gards these and begins to develop some 
stable modes of responses on the basis of 
his own performance. 

Al of these observations influence the 
design of our research ; we deliberately take 
our point of departure from the first two 
and confine our interest to the third. We 
stated earlier that our interest is in rules. 
What kinds of rules does a subject make? 
How are his rules different from the ex- 
perimenter's rules? Do these rules depend 
upon the nature of the stimuli, upon a 
knowledge of results, or upon the subject 
himself, or on a combination of these fac- 
tors? 

These and a host of other questions 
clamor for recognition and investigation. 
Twentieth century psychology has not been 
indifferent to this need. We have already 
seen how some of the earlier studies, 
characterized as dealing with concept for- 
mation, were investigating rules of induc- 
tion. Within the same framework, but one 
step further, Heidbreder (1946) introduced 
the idea of hierarchy in rules. The notion 
of hierarchy of rules implies the notion of 
complexity of strategies that subjects con- 
struct, In later studies the researchers were 
interested in the effect of rules upon learn- 
ing and recall. For example, Miller and 
Selfridge (1950) tested their subjects on 
memory of passages which were graded ac- 
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cording to their statistical approximations to 
English from zero to high order. The bet- 
ter recall of meaningful materials, the au- 
thors explained on the basis of the sub- 
ject's familiarity with the rules that 
governed the sequential properties of the 
meaningful materials, not the fact of mean- 
ing itself. Е 

Miller deals with the facilitatiing effect 
of rules of formation upon learning and 
recall in a later study (1958) in which he 
compares the {тее recall of redundant and 
random strings. Again he finds that even 
when the subject is unaware of the rules of 
formation that govern the structure of re- 
dundant strings he stil does better in 
memorizing the redundant strings. Prior 
to this study of Miller's, some experiments 
had been done by Aborn and Rubenstein 
(1952; Rubenstein & Aborn, 1954) to study 
the effect of rules of contextual constraints 
upon learning sequences of nonsense syl- 
lables. Test passages were developed by 
progressively limiting the context in which 
the nonsense syllables could appear. Thus 
the rules ranged from purely random to 
highly redundant. These rescarchers found 
that when subjects were made familiar with 
the rules of formation, redundant se- 
quences were somewhat easier to learn, but 
the amount of information contained in the 

recalled materials was not constant from 
passage to passage. 

So far the rules we have referred to 
have been simple rules ; they pertain to two 
kinds of stimuli—unit stimuli and sequen- 
tial stimuli—both of which have been gen- 
erated with some kinds of rules of forma- 
tion. Language is a good example of the 
second type of stimuli. Miller's rules, as 
well as those of Aborn and Rubenstein, and 
of Reiss, deal with rules of this type, that 
is, with rules of formation that govern 
sequential dependencies. 

Rules that govern sequential properties 
of a language also have important effects 
upon its statistical properties ( Miller, 
1951). Shannon (1949) analyzes the se- 
quential properties of language by proba- 
bility rules, a technique that provides access 

to information measurements as well as to 
organization. His rules deal with the selec- 


tion of a message тот а determinable 
number of alternative messages that could 
occur, and with the probability of its occur- 
rence. His rules deal with the amount of 
information contained in a message. Since 
Shannon's concern is primarily with the 
communication of a message and not with 
its use, value, storage, meaning, history, ог 
anything else, his rules are not of direct in- 
terest to us in the present context. His 
terms, however, and his techniques for 
measuring the amount of information 
transmitted through a noisy channel pro- 
vide the psychologist with some useful 
tools of analysis which information theor- 
ists ( Attneave, 1959; Hovland, 1952; Luce, 
1954; Miller, 1953; Quastler, 1955) have 
applied for measuring organization. We, 
too, shall draw upon these aids. 


In contrast to Shannon, the communica- 
tion engineer, we can Расе Chomsky 
(1957), who looks at language as a gram- 
marian. His rules are more general and 
more formal than those of Aborn and 
Rubenstein, though not more difficult; he 
refers to his rules as "finite state gram- 
mars." These simpler grammars enable us 
to avoid the complications of natural lan- 
guages and to work with simpler structures. 
But although his notion of rules is formally 
simple, it raises а few complex questions 
regarding relation between rules and per- 
formance. Miller (1958) points to this 
problem and he would "like to know what 
changes, if any, in a set of rules leave per- 
formance invariant,” (p. 485). And so 
would we, 

Simple rules of formation of sequential 
stimuli do not always produce simple effects 
upon responses. Some rules facilitate per- 
formance (Anderson & Leonard, 1958; 
Fitts, Weinstein, Rappaport, Anders 
Leonard, 1956; Miller, 
rules 


on, & 
1951) and some 
hinder it (Anderson & Leonard, 
1958). It seems necessary, therefore, to 
search for parameters that affect the con- 
struction and use of rules of formation. 
It is the purpose of this study to explore 
some specific experimental variables im- 
plicit in these rules that influence the ра“ 
terning responses of subjects. 
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The role of specific stimulus features in 
pattern perception has been previously 
studied by the Gestalt group, by Attneave 
(1954, 1955), by Fitts and his colleagues 
(1956), and by others (French, 1953, 1954b, 
1954c). Gestalt psychologists analyzed the 
peripheral properties in visual organization 
as proximity, similarity, continuity, and 
closed figure, but the process of organiza- 
tion itself could not, according to them, be 
understood without the field concept. They 
postulate an isomorphism between visual 
organization and the brain processes. Ac- 
cording to the principle of isomorphism the 
environmental field tends to become “рзу- 
chologically organized” in terms of regular- 
ity, symmetry, and simplicity. Attneave 
translated and quantified the Gestalt fac- 
tors of organization in terms of informa- 
tion measure of redundancy in the visual 
input. He measured memory for simple dot 
patterns on recall. He found that good 
form, redundancy, and noise were some of 
the experimentally accessible variables that 
affect pattern perception. Fitts et al. at 
Ohio State University have done special 
work with redundant forms in visual pat- 
tern recognition. They developed metric 
figures of random and constrained type and 
they measured the amount of redundancy 
contained in these figures. They studied 
the correlation of a measured amount ot 
redundancy with recognition of visual 
forms. Some additional variables, such as 
language habits and context (Miller, 1951). 
also have been found to affect the proba- 
bility of response. Carmichael, Hogan, and 
Walter (1932), for instance, demonstrated 
the influence of instruction upon response. 
Although we shall give a different inter- 
Pretation of their results later, we shall not 
Ignore the importance of instruction as а 
Dossible source of variation in pertormance. 

All the studies mentioned above suggest 
that a quantitative analysis of the experi- 
Mental variables in pattern conception 15 
Possible, These studies also help to separate 
Some parameters of pattern conception, j^ 
ticularly those concerned with the ens 
Of different alternatives used and with 
Structural similarity among the ut 

he number of different alternatives has 


proved to be an interesting parameter in 
studies in perception (Attneave, 1954) and 
in memory (Brogden & Schmidt, 1954a, 
1954b). Whether it is an important vari- 
able in pattern conception as well is the 
question we raise. Does a larger vocabulary 
help with the better ordering of the se- 
quences of symbols or not? If not, why 
not? One might expect the facilitatory 
effect of the larger number of different 
alternatives to reach an asymptotic value 
at some point. 

Structural similarity is another para- 
meter that must be explored. Its impor- 
tance in visual pattern recognition (French, 
1954a, 1954b), and in learning and re- 
call (Aborn & Rubenstein, 1952; Miller, 
1958; Rubenstein & Aborn, 1954) has 
been demonstrated. Does it also affect pat- 
tern conception? The notion of structural 
similarity as used in this study should not 
be confused with Shannon's notion of re- 
dundancy. The languages we are working 
with are, in a sense, "degenerate" because 
in the limit their redundancy is 100%. We 
define structural similarity as similarity in 
rules of formation and specify it in terms 
of the number of basic strings as well as 
the number and placement of recursive 
loops within each string. 


Materials and Procedure 


Miller and Chomsky (1958) have pointed 
up some of the difficulties of method and 
material in laboratory studies of concept 
formation. In the earlier concept forma- 
tion studies the subject was constrained by 
the nature of the task to find the rules 
as the experimenter defined them. Further- 
more, the use of geometrical patterns differ- 
ing in size, shape, and color effected ja 
certain stereotypy in material and usual y 
limited them to a finite number of possible 
alternatives. from which the positive 107 
stances were to be selected. Miller and 
Chomsky proposed the use of a new kind 
of material in the study of pattern con- 
ception, viewing it as а different kind of 
task: instead of spatial patterns, they pro- 
posed to study sequential patterns. By using 
finite state systems for materials one could 


12 
study pattern conception in a situation 
“modelled after the task of a grammarian.’ 
A finite state language is an artificial 
language consisting of the set of well- 
formed strings (sentences) that can be 
generated by a set of simple rules (see 
Chomsky & Miller, 1958). This set of rules 
is a kind of algebra, or grammar, for iden- 
tifying the grammatical sentences in a par- 
ticular language. A finite number of sym- 
bols makes up the vocabulary of a language. 
A finite state grammar consists of a finite 
number of states (So, S,, 5,,...) and rules 
for moving from one state to another. A 
state is defined as the set of all rules appli- 
cable to the state. Starting with any given 
state 0 there is a transition to a next state 
i via a symbol Thus, for example, 
So => S; which сап be written (0, a, i); 
this is one rule of transition of the gram- 
mar. A rule of a grammar determines 
whether there is a direct transition from 
the present state to one or more other 
states, or to the same state itself in the 
form of a recursive loop. Figure 1 is a 
graphical representation of the finite state 
generator used in the introductory Illus- 
trative Study. The numbered circles repre- 
sent the states of the system and the arrows 
represent the rules of transition permitted 
from one state to the next. From Sa, for 
example, the grammar permits us to move 
to S; then back again to S, thus: 
Sy pas d This loop can be traversed any 
number of times. The nonsense words аз- 
sociated with the arrows represent the 
finite set of symbols that comprise the given 
vocabulary of the language. This par- 
ticular finite state system has a vocabulary 
of three words—NEN, PAG, and ZIR, 


? A recursive loop in a string is the unit that 
can occur in the string from zero to any number 
of times. Expressed graphically, а loop in the fol- 


a 
lowing string So—>S,—> sty => S, occurs at $, 
а b c 


and is indicated by the looping arrow 
string could have any number of loops at 
the states. Each of the gramm 
III and IV contains 
strings. 


а А 
‹ any of 
ars in Experiments 
а double loop in one of its 
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which could be abbreviated as №, Г’, and 2 
for convenience. One of these words is 
generated each time we move from one 
state to the next. For example, in 5; the 
rules say either move to S, by generating 
the word PAG, or make a recursive loop 
by returning to the same state itself by 
generating ZIR апу number of times. 
Upon the first visit to Sẹ a sentence 15 


completed; a sentence is any well-formed 
string. A well-formed string is generated 
by always starting in S, and following any 
route in the diagram and ending in the first 
strings 


visit to Se. The well-formed 


RULES OF TRANSITION 
Е, 


(0,24) (,P,3) (3,2,3) (5,P,2) (7,2,8) 
(0,2,6) (2,N,5) (3,P,4) (6,P,6) (8,N,8) 
(1,P,2) (2,2,4) (4,N,0) (6,N,7) (8,P,0) 


BASIC STRINGS CANONICAL NOTATION 
SINS) ———CONVUAUÓON 


ZPPN ZP 
ZPZN 
ZNZP 


(Z) PN 
ZP (NP) ZN 
Z(P )NZ(N)P 


Fic. 1. A graphical representation of the finite 
State generator of sentences used in the Illustrative 
Study. (An admissible string is any sequence of 
words generated by starting in state 0 and ending 
on the first visit to 0’, All admissible strings can 
be expressed by means of the canonical notation 
given below the figure, Loops have been omitted 
in the basic strings. The rules of transition given 
beneath the diagram comprise an equivalent de- 
scription of the generators. Each rule defines the 
state in which it is applicable, the symbol it gener- 
ates, and the next state in which the rule leave 
the system.) 


SOME VARIABLES AFFECTING PATTERN CONCEPTION 13 


selected by the generator in Figure 1 are: 
ZPZN, ZPPN, ZNZP, ZPZPN, ZPNZP, 
ZNZNP, ZPNPZN, ZPNZNP, etc. Of 
these well-formed strings the first three we 
consider as basic strings. The important 
feature defining a basic string is that no 
state is visited more than once. When loops 
are inserted, states are revisited; basic 
strings with their loops we are naming 
formulas. Basic strings are significant be- 
cause, as Chomsky and Miller have shown, 
they provide the pattern for all the longer 
sentences in the languages; in grammatical 
terms, they are the linguistic “frames” into 
which the elements must fit. Any number 
of different sentences can be generated by 
following the rules and using the recur- 
sive loops any number of times; if we in- 
dicate loops by parentheses, however, all 
the sentences in the system shown in 
Figure 1 can be represented by a canonical 
notation of the general formulae : 
ZP(Z)PN 
ZP(NP)ZN 
Z(P)NZ(N)P 

At the bottom of Figure 1 are stated the 
rules of transition, the basic strings, and 
the canonical notation for the formulas. 
For example, (0, 7, D is a rule of tran- 
sition: it means that when the system is 
in S, it can generate the symbol Z and 
thus move to S,. One direct path traced 
by a set of rules of transition, without 
loops, is a basic string: with loops it isa 
formula. Thus (0, Z, 1), (1, P, 2), (2, Z, 
4), (4, N, 0/) yield the basic string 
ZPZN : by adding the following two rules, 
(2, М, 5) and (5, P, 2). а double loop has 
been added to the basic string which gives 
the formula ZP(NP)ZN. 

Finite state systems | att 
materials for experimental studies in psy- 
chology, Their formal simplicity and psy- 
chological complexity make them potentially 
Useful for the study of higher mental 
Processes. Moreover, the pattern and con- 
Struction of finite state systems lend them- 
Selves to quantitative handling by informa- 
tion measures, These materials are a new. 
и and quantitative tool Da; agri 

Чу of memory and thinking. Fer К 


provide attractive 


reasons we choose to use finite state sys- 
tems as materials for exploring pattern con- 
ception. From a finite sample of sentences, 
subjects are expected to infer the under- 
lying grammar. To what extent controlled 
variation in the formal structure of these 
systems may affect the behavior of the 
subject broadly defines the scope of the 
present research. 


А At the beginning of an experimental session a 
finite set of sentences from a particular language 
is presented to a subject in a rectangular array of 
cards on a table surface. The cards have been 
previously arranged by tables of random numbers 
and are all exposed simultaneously before the 
subject. (This procedure of presenting all the 
cards to the subject at once is different from the 
procedure followed in the Illustrative Study.) Each 
3" X 5" card contains one string of words only; 
the string is typed in capital letters on a single 
line. The subject sits facing the cards; he reads 
the typed instructions, then repeats them to the 
experimenter in his own words. 

Neutral instructions: "Here is an array of cards 
to be sorted into groups, into as many or as few 
groups as you wish to make and for whatever 
reason you wish to make them. When you are 
satisfied with your groups inform the experi- 
menter." 

Tf a subject has any questions regarding the 
time limit he is informed that there is none. Other 
questions are answered by merely referring him 
to parts of the typed instructions. When a subject 
begins to study the cards the experimenter starts 
a timer manually; the experimenter stops the 
timer as soon as the subject is through with his 
sorting. After the subject has made his groups he 
is asked to generate two sample strings to match 
each of his groups and the experimenter records 
the results of the subject's sortings along with 
any comments that the subject is willing to make 
about the way he performed the task. 

The following principles were implicit in the 
procedure: (a) the problem is presented in as 
unstructured a way as possible in order to give the 
subject freedom to structure the materials by his 
own rules, (b) the subject is free of time pres- 
sures, and (c) there is а three-way check (sorting, 
additional instances, and free comments) on 


whether or not the subject was using any con- 


sistent strategy in arranging the cards into cate- 


gories. 


Subjects 

Graduate and undergraduate students of Rad- 
clie College and Harvard University in the age 
group between 18 and 35 years, irrespective of 
feld of specialization, were rum as paid subjects. 
No subject served twice in any part of the experi- 


ment. 
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Aw EXPLORATORY STUDY 
(Experiment II) 


This experiment was a pilot study (а) to 
explore the kinds of strategies subjects изе 
in grouping a given set of strings generated 
by finite state grammars (if, indeed, they 
used any strategy at all), and (b) to de- 
fine some quantitative methods to measure 
the amount of "pattern" in each system of 
categories developed by a subject. 


Strategies 


Definition. А strategy may be qualitatively de- 
fined as the method or rule used by a subject for 
grouping a number of given strings into subsets. 
Operationally, a strategy is determined by the 
specific sentences that a subject puts into a subset. 
A specific strategy equates with a specific pattern 
a subject uses for making his groups. (This use 
of the term is different from the way Bruner 
defines it—Bruner сё al., 1956, p. 54). 

Quality. А strategy may be successful or less 
successful. It is considered successful if it cor- 
responds completely with the basic strings of the 
experimenter's grammar. [t is less successful if 
it deviates from the experimenter's rules. The 
degree of correspondence between the experi- 
menter’s rules and a subject’s rules determines the 
degree of the subject’s success. No strategy used 
by a subject is labeled as completely unsuccessful. 


Procedure 


The materials and procedure followed in this 
study have been described above; they are essen- 
tially the same in all the studies described below. 
In this exploratory study nine different finite state 
systems were set up in a 3 X 3 design; three 
different generators (grammar) of three, four, or 
five basic strings (see diagrams in Figures 2a, 2b, 
and 2c) were combined in all possible ways with 
three sizes of vocabulary, either of three, six, or 
nine words, Nonsense words were used in all the 
languages in order to minimize possible associa- 
tions with the stimulus sequences. A tenth language 
consisted of sequences of random strings which 
were generated with the help of tables of random 
numbers. A vocabulary of three words was used 
for this set. 

The languages were constructed with the follow- 
ing principles in mind: (a) Grammar IIT (five 
basic strings) is a hybrid composition from Gram- 
mars Гапа II. (b) The grammars are constructed 
to facilitate the experimenter in detecting objec- 
tively the strategy used by a subject. This prin- 
ciple places a constraint of congruence of certain 
parts—initial and/or final—in the structure of the 
basic strings. (c) The placement of symbols on 


transitional arrows is determined partly Бу the 


reason given previot iy, partly by haphazard as- 
signment, (d) The namber of instances (samples) 
in all ensembles in this study is 16. This particular 
design of three types of grammar and three sizes 
of vocabulary w used to include strategy con- 
struction under a variety of conditions to be tested 
more specifically later. The random strings con- 
dition was designed to serve as control for the 
nonrandom conditions. The nonrandom conditions 
are the nine grammar groups. 

Sixty subjects were run individually, six sub- 
jects per condition, Subjects were distributed one 
per condition in the order in which they came. 


eInalysis of Strategies 
Qualitative Лпауя5 


There were 16 instances of sentences from cach 
of the languages; the cards were numbered on 
the backs from 1 to 16. Each subject’s sorting 
was recorded in terms of these numbers. The 
grouping produced by a subject was then com- 
pared with the various groupings that would 
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FIGURE 2c --GRAMMAR TIL 


Fic. 2. Finite state generators for sentences 
used in the Exploratory Study (Experiment 11). 
(The small letters on arrows indicate similarities 
and differences in the initial, final, and recursive” 
loop positions of symbols in the sentences 28 
erated). 
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result from following the different strategies. 
Thus, if the subject's groups were (3, 7, 15, 16) 
(1, 5, 10) (11, 12) (2, 6, 13, 14) (4) (8) (9), 
they corresponded with groups that resulted if the 
Length strategy were followed. Length strategy 
refers to grouping the instances by the number of 
words—the length of the sentence—contained on 
each card. The experimenter had worked out 
various groupings in terms of possible strategies 
so that a strategy could be objectively identified. 
ОЕ course, the matching samples generated by 
a subject for each of his groups and the protocol 
were additional checks upon the identification of 
the subject's strategy. 

Out of the billions of possible partitions of the 
16 instances only about 20 strategies, not always 
markedly different from cach other, were found 
to be used by subjects. The five most common 
strategies were labeled as follows : 

Length: Subject puts together instances having 
the same number of words. 

Alphabet: Subject alphabetizes the whole set 
of instances, treating it as a single category. 

Initial : Subject sorts b milarity found in the be- 
ginnings of sentences. This similarity could range 
anywhere from one to four words. 

Initial-Final: Subject sorts by the identicalness 
both of the beginnings and the endings of sen- 
tences. 

Formula: 
experimenter's rules for the finite state 

Miscellaneous: This category is a 
group for isolated cases of sorting such as sort- 
ing by a single striking word, or by the repetitive 
elements, or by rhythm, etc. 


This strategy approximates to the 
generator. 
cover-all 


Quantitative Techniques 


Since a variety of strategies was employed, 
strategies ranging from partially successful ones 
to the one that does succeed completely, it hecomes 
necessary to order or scale the strategies. Earlier, 
we defined a successful strategy as one that 
completely corresponds with the basic strings of 
the experimenter’s grammar. Three measures for 
Scoring strategies were employed: an agreement- 
disagreement measure, а rating scale, and a time 
score. The first two were constructed specifically 
for evaluating strategies in this research. 

Agreement-Disagrecment Кано (A-D ratio). 
This statistic is а kind of correlation. technique 
for measuring the degree of agreement and dis- 
agreement between the experimenter’s grammar 
and а subject's categories; OT between any two 
Subjects’ categories. The ratio expressed (roughly) 
by the observed number of agreements over the 
total number of possible agreements т grouping, 
Yields a measure of correspondence. When the 


two groupings are identical, the A-D ratio ies 
Unity (see Appendix A for details of this tech- 


nique), 
Rating Scale (R scale). Each subjects — 
Was assessed on а 10-point scale developed for 


this purpose. This scale was set up in the follow- 
ing way. Each subject’s performance was classi- 
fied into a strategy by the methods previously 
stated. Approximately 20 strategies seemed avail- 
able to the subjects. Identical randomized lists 
of these were presented to five judges for rating. 
The judges were graduate students at Radcliffe 
College and Harvard University. They represented 
different fields of experience and cultural back- 
grounds. The judges were instructed jointly to 
rank on a 10-point scale the strategies subjects 
used. The instructions given to the judges were: 


You have been handed a list of labels for the 
strategies subjects used in an experiment in 
sorting into groups a pack of cards prepared 
by the experimenter. [Judges were shown the 
cards and the experiment was explained to them. 
Also, the labels of the strategies were described 
or explained.] You are being asked to rate 
these strategies by assigning them values be- 
tween 1 and 10. Rate the poorest strategy 1 
and the best one 10. You may assign the same 
value to several strategies if you think they rate 
the same and you may indicate distance in the 
quality of strategies by the size of the interval 
you leave between any two. Any questions? 


After the questions were answered the judges 
worked independently. From the averages of the 
scores assigned by the judges to the strategies а 
standard 10-point rating scale for evaluating 
strategies was developed (see Appendix B for 
details). 

Time Measure. A record of the total time re- 
quired for sorting by each subject was made. 
Thus the number of seconds per word provided 
a third measure of a subject's performance, 


Reliability of Measures 


The A-D Ratio. Although this statistic is a 
fair measure of success in terms of the cor- 
respondence of a subject’s responses to the ex- 
perimenter's rules for generating the strings, it 
is subject to experimental artifact. For example, 
the Initial. strategy (instances grouped together 
on the basis of the first word only) yields a 
spuriously high correlation. score. The number 
of subjects using the Initial, strategy is com- 
paratively small, however, and the other two 
measures offer a second check on the results. 

A second difficulty with this statistic is that 
interlanguage comparisons cannot be made without 
using some kind of correction factor for the num- 
ber of instances in the set. The measure of agree- 
ment rises with the increase in the number of 
instances; the measure is not really comparable 
from one language system to another. 

The R Scale. The following table shows the 
reliability of the ratings of strategies made by five 
judges. { correlations were used. 


Rank-difference 
Tt should be noted that the equality of intervals 
between ranks in this scale cannot be established. 
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TABLE 1 


RANK-ORDER CORRELATIONS BETWEEN JUDGES FOR 
RariNG STRATEGIES 


Judges 1 2 | 8 | 4 
2 .89 
3 .91 77 
4 .88 .89 .82 
5 291 .87 .89 .88 


—From the ratings assigned by these judges to strat- 
egies was developed the 10-point rating scale. 


Time Score. Although time measures have been 
commonly used in studies of visual pattern per- 
ception (Adams, Fitts, Rappaport, & Weinstein, 
1954; Anderson & Leonard, 1958; Fitts et al, 
1936), in the present research they are only a 
rough measure. Several factors are simultaneously 
operative which decrease the reliability of the time 
scores. As the number of instances in the set 
increases, the time for sorting increases dispropor- 
tionately, paralleling the rule that time increases 
according to the amount of information that must 
be processed. Both length and complexity of task 
tend to increase time, but if a subject works with 
the self-instruction of speed, he finishes the task 
much more quickly. The difficulty here is circular. 
The less time a subject spends with the problem 
the less information he extracts from the input 
and the less time he spends dealing with his 
problem. 


Results 


The distribution of subjects in the Ran- 
dom group is compared in Figure За with 
two of the nine groups (L3 and 111,9) 
with respect to the kinds of strategies used 
in each. The bars for the Random group 
and the (111,9) group (five basic strings 
and a vocabulary of nine words) are longer 
at the opposite ends. Condition (1,3) 
(three basic. strings and а vocabulary of 
three words) has more affinity with the 
Random group than is apparent from the 
figure; three of its subjects who were 
classified under the Miscellaneous condi- 
tion in the type of strategy used actually 
used some kind of differentiated counting 
technique that was slightly more complex 
than the plain counting involved in the 
Length strategy. Figure 3b compares the 

same three groups by the average number 
of different categories made by cach in the 


Н.6 FOR EACH GROUP 


a E (1,31 - бат 
CZ) 1,91- GROUP 
ШШ Pario- crou 


За. Frequency comparison of strategies by sub- 
in the Random and two grammar groups. 


j 3 H . 
(The Random group worked with 16 strings that 
had been haphazardly generated. The (1,3) group 


worked with 16 well-formed strings that had 
been generated with three formulas and a vocabu- 
lary of three words. The (111,9) group worked 
with 16 well-formed strings that had been gener- 
ated with five formulas and a vocabulary of nine 
different words.) 


тб PER GRAMMAR CONDITION 
G+ NUMBER OF INSTANCES IN 
b EACH CONDITION 
FILLED BARS REPRESENT NO, OF 
DIFFERENT CATEGORIES. 
SPACED BARS REPRESENT THE 
WEAN NO OF CATEGORIES, 


RANDO 


(ш эу 
m 


© STRINGS 


3b. Mean number of categories for subjects in 
the Random and two grammar groups. (Solid 
columns—first in cach pair—compare the same 
three languages for the number of different cate- 
gories made. Different categories refer to instance 
categories, e.g., one-instance category, two-instance 
category. Broken bars compare the same three 
groups for the average number of categories for 
each group.) 


Fig. 3. Frequency comparison of strategies and 
mean number of categories. 


first columns (filled bars), 
categories we mean instance 
example, 


(By different 
categories; for 
category, two-in- 
stance category, and so on.) Notice how 
the Random condition, 


one-instance 


where every in- 
stance belongs to а scparate category, aver- 
ages to the largest number of different 
categories. The second part (spaced bars) 


SOME VARIABLES AFFECTING PATTERN CONCEPTION 17 


compares the groups for the mean number 
of categories made by each. Here the 
difference between the Random group and 
(1,3) group is negligible. 

Table 2 is a frequency tabulation of all 
subjects showing their distribution by the 
kinds of strategy they employed and the 
experimental conditions under which they 
had worked. Note that there are just 
about five or six types of strategy that 
cover all subjects under the various condi- 
tions tested. Also note that the better ог 
the more successful strategies are used 
under some conditions more than in some 
others. For example, none of the 18 sub- 
jects used the Length or the Alphabet 
strategies when the number of alternative 
words used was nine. Ten of these subjects 
used some form of the Formula strategy. 
When the sentences were formed with a 
vocabulary of only three words, on the 


other hand, 5 of the 18 subjects used the 
simpler strategies and only 4 discovered 
the Formula strategy; all of those 4 sub- 
jects came from the same condition group. 

For each of the 10 experimental condi- 
tions, Table 3 presents means which have 
been calculated by assigning values to 
strategies by the statistical techniques dis- 
cussed above. Because the experimenter 
worked with no rules in the Random set no 
A-D ratios can be calculated for that 
strategy. Ratios for Condition ПТ are found 
10 be higher for each constant condition 
of vocabulary size, except in the grammar 
Group (11,3), which may be treated аз an 
exception in all the tables given subse- 
quently because of an artifact in the con- 
struction of the language. The group means 
for the R scores are tabulated in the second 
row. Here the Random group and (1,3) 
group are more alike, as we have com- 


TABLE 2 


FREQUENCY DISTRIBUTION 


or SUBJECTS IN THE RESPECTIVE LANGUAGE GROUPS BY THE 
TYPE or STRATEGY THEY USED 


Kind of Strategy | 


Grammar N 
Length | Alphabet Initial | Init.-Final) Formula Mise. | 
| | 
Random; 3 2 1 1 А _ 1 | 6 
(1,3) 1 2 | ай 3 | 6 
(11,3) 1 1 | " | 
(111,3) AN 1 1 | 3 ) | 
Total (3) Q | (D e | W (3) i 
Мн 95 whee: TE ايسا ا‎ 
(1,6) BEN 2 | 1 1 | : | 1 
(11,6) 2 : | и | 2 i 
(111,6) سا‎ 2 2 1 
pa | Ф = NN NM. Qo в 
—— — —— — —— |7 — — — ا‎ 
an Pot etal | 
(11,9) | | | | | i 
(111,9) E S See Nr 
"Ep o mE = "m @ | a9 | 18 
Total = | E | == С =: 
5 | Е 13 18 6 
Sum Total 10 2 11 | 
E. | mediately preceding those totals. 


re sums of frequencies im 


Nore- Е: : 
iiio: arentheses а! 
SES E to the (11,3) group. 


® These four subjects all belong 
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TABLE 3 
Group MEANS FOR TEN EXPERIMENTAL CONDITIONS 


Language Group 0 
Mean Score | 
| 03 | 18 | 13 111,3 1,6 | 11,6 111,6 | 1,9 11,9 111,9 
Ж | | 
| | | | | 
A-D ratios == 6 | 773 | .616 .437 .601 | .73 | „728 | 805 85 
| а s | _ Ж Д И 
1i 55 53 
R score | 1.83 | 2.83 | 7.0 | 3.71 | 3.66 | 4.66 | 3.33 | 6.83 5.9 1.2 
| | | | Mah Foil ама ы 
| | | | | m 
Time score | 6.04 ү | #08 | 6.36 | 3.61 |4.75 4.64 | 4.00 | a | 6.05 
| | | | 


Note.— The means have been calculated by three independent statistical measures: A-D ratios, Rating scores, and Time scores. 


Time scores represent average time taken per syllable. 


mented before, and both are inferior to 
Groups (19), (1,9), and (IIL9). The 
time comparisons in the third row are some- 
what fuzzy. Sometimes a subject, who may 
have worked with the instances for a very 
long time, will settle on a very poor 
strategy at the end ; sometimes, the opposite 
is true. 

А comparison of the group means by 
vocabulary and by grammar by applying all 
three measures is made in Table 4. 
Although the figures given in this table are 
not always directly comparable, their rela- 
tive increase or decrease with change in 
the experimental condition is constant. 


Discussion 


This first study was intended to explore 
the kinds of responses subjects make to a 
categorizing task. The task as we have de- 
fined it is one of sorting strings of words 
simultaneously presented to a subject on 
small cards. The subject is free to work on 
these instances for as long as he wishes, 
without receiving further help from the 
experimenter, however, or from the in- 
structions that are given to him. He is 
neither told how many categories he 
should make, nor is he given any indica- 
tion of success or failure. The criterion 
of success, if any, is set by the subject him- 
self and he does not have to form groups 
if he thinks no groups can be made, 

Though the experimenter is somewhat 
in the dark as to what the nature of the 


controlling variables is, he does, however, 
anticipate a few problems. Therefore, he 
sets up his first study under a variety of 
experimental conditions to get leads about 
the possible functional relations between 
some of the experimental variables and 
subjects’ responses. The basic variables 
that the experimenter works with are: the 
number of formulas which provide the 


TABLE 4 


А Comparison or RELATIVE DIFFICULTY OF 
GRAMMAR AND OF VOCABULARY AS INDICATED 
BY THREE INDEPENDENT MEASURES 


Grammar: Random| 1 п TII 
A-D ratios — «53 512 43 
R score 1.83 | 4.44 | 5.7 5,1 
Time/Word | 6.04 | 4.85 | 5.91 | 5.59 

Vocabulary: Random 3 6 9 


A-D ratios — 


R score 


4.6 |3.9 


Time/Word | 6.04 


5.48 | 4.51 
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basic framework for the strings and the 
number of different alternative words with 
which he develops his formulas. 

The results illustrate several important 
features about subjects’ responses in an in- 
formation processing situation. First, sub- 
jects process information by rules, or as we 
have suggested earlier, by pattern concep- 
tion. Next, we note that the rules formu- 
lated by more than 90% of subjects can be 
classified under five headings: Length, 
Alphabet, Initial, Initial-Final, and Гог- 
mula strategies. Third, there is a hierarchy 
within these five or six types of strategies 
which has been identified in subjects' re- 
ponses. These strategies vary from simple 
to complex. Finally, the results indicate 
that some experimental conditions are bet- 
ter than others for inducing the formula- 
tion of the more successful strategies. The 
reader will recall that the more successful 
strategies have been defined as those that 
have a higher agreement with the experi- 
menter's formulas. The Random strings, 
which were haphazardly constructed, em- 
body a condition which is least favorable 
to the emergence or acceptance of pattern 
strategies in subjects’ behavior. Evidence 
irom our data suggested that vocabulary 
and grammar may be two of the more 
effective parameters in pattern conception. 


Pattern Conception. What we had ob- 


served as a dramatic instance of orderly 
responses demonstrated in the introductory 
study, was confirmed by the responses of 
more than 50 subjects in this exploratory 
experiment. It is not а fortuitous matter 
that no subject, not even one from the 
Random group, handed the cards back to 
the experimenter in some haphazard ar- 
rangement ог declared the task insoluble. 
On the contrary, in each case the subject 
used some kind of a system by which he 
had looked for relations among the in- 
Stances to create his categories. Even with 
the Random group some subjects worked 
very hard and long trying to find some sort 
Of scheme to organize their с 

Ву and large, the subject could verbalize 
the type of relations that he had discovered 


information. 


in terms of a rule or rules though he was 
not always able to do this with precision. 


Hierarchy in Rules. The thing that 
amazed us most was that nearly all subjects 
could be classified so easily within the same 
general types of five or six strategies— 
strategies which were, by and large, the 
same ones we noted earlier in the responses 
of our single subject described above in the 
Illustrative Study. In other words, a 
hierarchy in rules that was indicated in the 
individual’s responses seems to characterize 
the group responses as well. This is a 
two-sided observation: it points first, to 
the fact that responses under all conditions 
of card sorting described above are classi- 
fiable within five or six easily detectable 
strategies; and second, that these strategies 
themselves can be ordered in complexity. 

We can classify these strategies into two 
types: the ready-made type and the in- 
vented type. Ready-made rules are inde- 
pendent of the nature of the information 
to be processed. The rule of counting, for 
instance, to make groups of equal length, 
already exists in a subject’s repertory and 
can be easily transferred to almost any kind 
of material. The ready-made rule applies 
whether the materials be heads of cattle, 
pictures in an art gallery, or the coded 
messages about the enemy’s moves. This 
rule of counting is an instance of a simple 
quantitative strategy that is highly useful 
in dealing with certain kinds of informa- 
tion, but not so appropriate for certain 
other kinds. The counting rule may take 
on slightly different shades, as counting the 
number of different words, or counting the 
number of repetitions of any word, etc., but 
essentially it is the same rule that is applied. 

Alphabetizing is here classified as a 
ready-made strategy because this rule, too, 
already exists in the everyday repertoire of 
a college student. Since it is always possible 
to alphabetize a given series of words of 
апу language, it requires little ingenuity on 
the part of a subject to apply this rule. 

Tt is important to note that a subject 
loses information by applying the simple 
strategies of counting and alphabetizing to 
these not-so-simple sequences. We have, 
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therefore, evaluated such strategies as being 
less successful. А strategy is rated as less 
successful in proportion to the amount it 
deviates from the finite state grammar 
which governs the given strings. In the 
simpler strategies a subject seems to work 
more with the perceptual characteristics of 
the information than with the structural 
ones. 

Subjects in the Random group, which is 
the control group for all the nine experi- 
mental groups, seem to resort to the simple 
strategies, particularly the one of counting, 
more than the subjects in the grammar 
groups seem to do, except the subjects of 
the (1,3) group. The responses of this 
group have close kinship with the responses 
of the Random group. When a subject in 
any group either finds the task too difficult 
for him or when he does not wish to exert 
himself, functionally, he is treating the 
grammar strings as if they were random 
strings. Hence the strategies that subjects 
would be found to be using in these in- 
stances would be close kins to counting or to 
alphabetizing strategies. It seems highly 
probable that subjects in the (1,3) group 
were treating the grammar strings as ran- 
dom strings. 

The observed fact that a subject in the 
Random group follows a counting strategy 
does not necessarily imply that the subject 
has worked with one strategy only. Sub- 
jects’ protocols bear evidence to the con- 
trary. Several subjects described complex 
strategies that they had tried to apply to the 
strings, but which they had discarded be- 
cause they always seemed to find too large 
a number of single cases. Notice that a 
subject is uneasy about having too many 
one-case categories, and this is virtually 
what he would find in the random set 
where each instance is unique. After having 
searched for abstract relations, a subject is 
forced by the stimulus properties of the 
set to settle for something less. This sample 
of behavior is apropos to the basic argu- 
ment in this thesis. 

Strategies labeled Invented are different 
from the ones just described. We classify 
the Initial,-Final, strategy and the For- 
mula strategy under this label because it is 


the nature of the materials which suggests 
these strategies to the observer. A high 
degree of discovery, however, is also in- 
volved in these strategies, but since a sub- 
ject is unaware of the embeddedness of any 
formulas within the materials, we prefer 
to call them the Invented rules rather than 
the Discovered rules. This label, however, 
applies most appropriately to the Formula 
strategy. In this strategy the subject has 
to go beyond the obvious information. The 
subject has to tease out an invariant 
sequence of words in each of the instances, 
as also he must find the recursive units. 
The search for invariance and variance 
yields formulas which are the basic frame- 
work for the grammar strings. This type of 
response calls for a certain degree of ab- 
straction in order to discover the true pat- 
tern implicit in the strings. 

Invented strategies correlate more closely 
with the information given. Since the in- 
structions a subject receives are neutral, his 
only source of information is the set of 
strings he has before him. The amount of 
information he is able to extract from them 
is a critical factor in the way he will process 
the information. His efficiency in ex- 
tracing the maximum amount of useful 
information depends largely upon two, 
sometimes interdependent, conditions of 
motivation and stimulus complexity. 


Motivation. Each subject seems to re- 
define the problem to himself and it is very 
important how he does define the task. 
Subjects who look for quick and easy solu- 
tion—whose motivation appears to be to 
escape the situation as quickly as they can 
—do not get themselves involved in the real 
problem of finding relations among sets of 
instances. They tend to manipulate the 
more obvious characteristics of the stimuli 
and fail to extract useful information con- 
tained in the sequence order of the words. 
Though they might also see other little bits 
of isolated information, they tend to ignore 
these and proceed to apply the easy rules 
of counting and alphabetizing, thus mini- 
mizing the cognitive strain. 


Stimulus Complexity, Subjects who 4° 
not get seriously involved in the task m2? 


a. 
~~ е 
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or may not end with the Invented strate- 
gies. One subject in the Random group 
worked with her 16 cards for 41 minutes. 
She tried to make groups by initial similari- 
ties, by ratio relations, by sequence rela- 
tions, and by a method of substitution. But 
when none of these strategies worked by 
her criterion of success, she resorted, ac- 
cording to her own version, to “the simplest 
strategy of counting." The task of seeing 
relations among the random strings is per- 
haps psychologically more complex than the 
task of seeing relations where relations do 
in fact exist. 

The results from all nine 
group conditions of finite state systems 
where relations do in fact exist, do not 
distribute under the Tnvented strategies 
only. Some do and some do not. In Table 
2 we notice that the stimulus condition, 
where nine alternative words were used to 
develop the strings of sequences, as conr 
pared to three in the Random group. and 
three and six in the other two conditions, 
more than 50% of the subjects evolved the 
Formula strategy as compared to the 0% 
of the Random group and 2266 in each of 
the other two. Of these, the 20% of sub- 
in the vocabulary three group. all 
come from a single condition, (11,3), 
meaning that none of the subjects from 
Condition (1.3) or (111,3) or (0.3) ar- 
rived at the Formula strategy. 

Are these differences. indicated in the 
percentages of subjects who evolve the 
Invented strategies and those who do not, 
functionallv related to the complexity of 
the experimental variables under which 
subjects work? If 50, what constitutes ех- 
perimental complexity? That is the ques- 
tion we turn to in the next section. 


grammar 


jects 


EXPERIMENTAL VARIABLES 


In the exploratory study of the previous 
section evidence for a search for pattern 
was found. Sometimes this search was 
Creative, sometimes transitive, but what was 
not clear from the data was why this differ- 
ence existed in the quality of performance. 
Informal evidence suggested that instruc- 
tion was one variable that needed to be con- 


irolled more precisely and the data indi- 
cated that some stimulus features of the 
input were contributing to the outcome. 
Accordingly, some experiments were carried 
out to assess the effect of the following 
four variables: 

1. Instructions. Four different instruc- 
tions were tested to select one that would 
maximally involve the subject in the task, 
but with minimal guidance from the nature 
of the instructions. 

2. Number of instances. The number of 
instances per formula was varied from two 
instances to three to four, to test the effect 
of the amount of evidence on the quality of 
strategy. 

3. Number of different alternatives 
(Vocabulary). Three or nine different 
words comprised the vocabulary for de- 
veloping the different language systems. 


4. Structural similarity (Grammar). The 
effect of the formal structure of the lan- 
guages, three rules as against five rules 
with variation in similarity between the 
rules, was also investigated. 


The Instructions 
( Experiment TID) 


Materials 


Two new artificial languages, (1,9,16)» and 
(111,3,23) . were constructed to study the effects 
of four types of instruction on each. Language 
(19,16). is constructed with three formulas in 
its grammar; its vocabulary consists of nine words. 
Sixteen instances were selected to make up this 
set. Language (111,3,23) ‹ contains five formulas; 
three words have been used for its vocabulary. 
This set contains 23 instances of well-formed 


strings. Two additional language conditions, 
(1,3,16) апа (1,3,23), were also tested, both 
under "eliminate speed" instructions. Altogether, 


then, there were 10 language conditions that were 


compared in this study. 


Procedure 

The same general procedure described earlicr 
was used in all experiments reported in this sec- 
tion. The specific variable to be considered here 
kind of instruction to be used. Ten groups 


is the es 
of subjects, four subjects per condition, were run 
under four different instructions. Мо subject 


served twice in any part of the experiment. Sub- 
jects were distributed, one per condition, in the 
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order in which they came. The following four 
sets of instructions were used : 

1. Neutral "Here is an array of cards to be 
sorted into groups, into as many or as few groups 
as you wish to make, and for whatever reason 
you wish to make them. When you are satisfied 
with your groups inform the experimenter.” 

2. Eliminate Speed. “Here 15 an array of cards 
to be sorted into groups, into as many or as few 
groups as you wish to make, and for whatever 
reason you wish to make them. Your performance 
wil be judged mot by speed, that is, not how 
quickly you are through, but by what you do. 
When you are satisfed with your groups inform 
the experimenter.” 

3. Relations. "Here is an array of cards to be 
sorted into groups, into as many or as few groups 
аз you wish to make on the basis of some relations 
that you can find in the members of the groups. 
Your performance will be judged mot Бу speed, 
that is, not by how quickly you are through, but 
by how well you discover relationships. When you 
are satisfied with your groups inform the experi- 
menter." 

4. Logical. "Here is an array of cards to be 
sorted into groups, into as many or as few groups 
аз you wish to make on some logical basis. Your 
performance will be judged not by speed, that is, 
not by how quickly you are through, but by how 
logical you are. When you are satisfied with your 
groups inform the experimenter." 


Results 


Table 5 presents the means, ranges, and 
critical values for the A-D ratios, R scores, 


and Time scores for 40 different subjects 
run under 10 different experimental condi- 
tions. The critical differences between 
groups were calculated by the Link-Wal- 
lace method of short-cut allowance for one- 
way analysis of variance (Mosteller & 
Bush, 1954. р. 305). This statistic uses the 
range as a measure of variation instead of 
the sum of squares. 

Conditions Г, G, H, and I in Table 5 
apparently belong together. The difference 
between any two totals in this group is 
below 1.51 in the A-D ratio comparisons. 
below 18.80 in the К score comparisons, 
and below 41.59 in the time comparisons. 
It is clear, then, that variation in instruc- 
tions fails to produce any recognizable 
difference in the performance of subjects 
in language (111,3,23),. Presumably, the 
task involved in this language is too hard 
for the subjects. Even when the subjects 
(Column H) took longer than subjects in 
any other group, they failed to give efficient 
results. Since the complexity of this lan- 
guage masks the effect of instruction (if 
any), we shall concentrate on the simpler 
Grammar I. 


Conditions C, D, E, and J likewise 
obviously belong together. None of the 
differences between any two of these con- 


TABLE 5 
Means, RANGES, AND CRITICAL VALUES 
A-D ratios R sc i с 
ЕС score Time score 

M в | cv | M È | | м R CV 
А (1,3, 16)  Elim. Speed | .333 | .40 1.75 | 3 2.38 | 1.39 
B (L9,16) Neutral -402 | .433| 1.95 | 4.25] 7 0.47 | 3.55 
C Elim. Speed | .69 | |25 7.5 | 6 aa oe | 
D Relations 95 | .20 9.75] 1 5.49 | 4.35 
Е Logical 76 | 14 15 | 6 5.35 | 271 
Е (III, 3,23), Neutral 58 | .21 | 1.51 | 4.0 2 |18 
G Elim. Speed | .50 | 43 5.25 | 5 os i vt ee 
H Relations 58 | .27 5.0 | 4 18.58 | 22.59 
1 Logical 49 | 43 5.75 | 5 8.55 | 17.70 
J (1,3,23) Elim. Speed | .79 -46 8.75 1 9.76 | 10.09 | э 

| } | 
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ditions exceeds the calculated critical values 
in any of the three measures. АП of these 
four conditions belong under the simpler 
Grammar I, which also includes Condi- 
tions А and B. Subjects who served under 
Condition A performed most poorly, even 
though, as in C, D, E, and J, they were 
speed. Subjects for 
(19,16), who served under Neutral in- 
structions (see Column B) do not resemble 
or differ consistently with the other groups. 


Discussion 

In the preceding section we raised a 
question about why some subjects follow 
easy strategies and some others follow more 
complex ones. Ап analysis of the subjects' 
protocols gave an indication that one neces- 
sary, though not sufficient, condition for 
evolving a good strategy is that the subject 
should extract all the useful information 
from the instances he receives. This busi- 
ness of extracting the most useful informa- 
tion is in itself a very critical step in pat- 
tern conception. The success with which 
а subject decodes relevant information 
from the stimuli depends partially upon 
the degree to which he involves himself in 
the task, This in turn is influenced by a 
variety of factors, such as the past history 
of the subject, his present motivation, his 
he instruction he re- 
enter regarding the 
other things being 


age, etc., as well as t 
ceives from the experim 
nature of the problem, | 
equal. An individual's history or his moti- 
vation is difficult to control by laboratory 
techniques, but the experimenter can ex- 
plore the instruction variable. | | 

Actually, а subject receives two kinds of 
information from the experimenter. One 
kind comprises the main body of informa- 
tion that is contained in the strings of non- 
sense words with which the subject is sup- 
posed to work; the other is some informa- 
tion about this main body of information 
to be categorized. The main body ot ш 
mation is contained т what Bartlett ( 1958) 
has called a “closed system," which the 
Subject must discover. The efficiency sem 
Which a subject processes the main boc y 
of information will in part depend on how 


well he has decoded the information from 
the instances presented to him. It is the 
experimenter's wish to get a subject started 
on this search but without telling him how 
to proceed. Thus in setting up the instruc- 
tions the experimenter keeps two things in 
mind: to get a subject maximally involved 
in solving the problem and to do this with 
a minimum of guidance from outside. 

Neutral instructions are suspect when 
they are evaluated by this two-edged 
criterion. In the absence of specificity, both 
of instruction and of materials (nonsense 
strings), a subject may often resort to self- 
instruction, either to work for speed, or to 
find some easy-way-out strategy; he there- 
by loses much information. To overcome 
this difficulty it seems necessary to furnish 
a subject with some other kind of instruc- 
tion. 

French (1954a) has shown that perform- 
ance improves with more specific instruc- 
tions in a task that involves sequential de- 
pendencies. The question here, however, 
is not one of improving a subject's per- 
formance by giving him more guidance or 
by making the instructions more specific 
we take that for granted—but rather the 
question is one of finding the kind of in- 
structions that will induce a subject to 
make the maximum effort to solve the prob- 
lem, and to bring this about with as un- 
structured a kind of working instruction as 
possible. 

We noted in the results that when a 
language is too difficult for the subjects, as 
(111,323). seemed to be, the effect of in- 
structions is confounded. But with the 
simpler Grammar I, subjects do better 
when they are instructed to slow down, 
with the one notable exception of (1,3,16) 
— see Condition A. We pointed out earlier 
that the results of Condition A are the 
poorest of all the conditions tested. A 
differs from В, С, D, and E in the size of 
the vocabulary ; it differs from F, G, TET 
and T in its grammatical structure; and its 
difference from Condition J seems to be 
related to the number of instances. 


clearly t 
A suffers in part from one or 


Presumably, 
more of these conditions. 
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The difference between B and Conditions 
C, D, E, and J was attributed to different 
instructions. All four conditions, C, D, E, 
and J, required subjects not to rush (note 
that Relations and Logical also eliminate 
speed), and they all emphasized that it is 
what a subject does that is more impor- 
tant than speed. Condition B is neutral to 
these emphases. It does not seem unreason- 
able, therefore, to suppose that the em- 
phasis on “what you do" increases the search 
for useful information, and this search ior 
the useful information, we have granted, is 
one critical step in pattern conception. 

Johnson (1955) discusses the effect of 
instruction on performance under the con- 
cept of “set” (see pp. 64-67). After de- 
fining set as "temporary integration of be- 
havior," he elaborates on its meaning as in- 
tegration in terms of the obvious and simple 
things, or integration in terms of "subtle 
aspects of complex stimulus patterns." Ex- 
plained in this way, set seems to be no 
different from recoding of information. 
Verbal instructions can drastically change 
the quality of recoding. As for instance, in 

the Carmichael, Hogan, and Walter (1932) 
experiment the same figures are recoded as 
dumbbells by one group and as spectacles 
by another group of subjects, depending 
upon the instructions the group received. 
In studies such as this one, the experi- 
menter has already laid down the recoding 
Scheme for his subjects through the in- 
structions he provides, and then he is sur- 
prised (and pleased perhaps!) to find that 
subjects have followed his scheme. 

Tt is this kind of coaching in the re- 
coding scheme of a subject that we wished 
to avoid in our instruction. For, after all, 
a major interest in the present research is 
to study those factors that affect the re- 
coding scheme of a subject. Very little 
new information will be gained by recover- 
ing the same coins that we originally fed 
into our slot machine. 


So, what do we finally conclude? On 
the whole one could say that the instruc- 
tions, to the extent to which they were 
varied, did not produce a strong effect on 
the quality of the performance. Other 


variables (complexity of grammar, num- 
ber of different alternatives, number of in- 
stances) seem to be more important be- 
cause they can make the problem so hard 
that they mask out the effects of instruc- 
tions. When conditions are optimal for 
instructions to have an effect, the main 
effect that changed instructions produced 
was to slow subjects down to make them 
stop, look, and think more carefully, and 
this is the only effect the experimenter 
wished to induce through instructions. Any 
more help than just maximally motivating 
subjects to think would be a violation of 
what has been described as good instruc- 
tion. Since the instruction to eliminate 
speed, together with the emphasis on “what 
you do," is common to all three sets of in- 
structions tested, it seems reasonable to 
suppose that this amount of change in the 
neutral instructions was enough to involve 
the subject in the task. Therefore, we 
adopted Eliminate Speed instructions as 
standard for all subsequent studies. 


The Number of Instances 
( Experiment IV) 


Materials and Procedure 


One important characteristic of an ensemble of 
instances is its size. The size of the set can be 
controlled by controlling the number of formulas 
of a grammar and/or the number of instances 
given for each formula. Moreover, one might 
like to control the total number of words in the 
ensemble. Тре structure of the languages used 
in this study is clarified in Table б. 

The Random condition contained по formal 
grammar; it served as the control group for all 
the other conditions. Languages B, C, and D are 
identical except for an orderly increase from two 
instances per loop in (13,13) to three instances 
in ‚1,318) to four in (1,3,23). The number 
of instances per loop and the size of the Set are 
interrelated; as instances increase in number, the 
size of the set increases also, | 

Languages (1,3,13) and (1,9,13) are 
cal except for the difference in the size of 
the vocabulary: B is developed with three words; 
Е with nine. Thirteen instances for Grammar 1 
have been considered to comprise a minimum set 
hy definition, Condition F, (1,9,16)», represents 
a random increase in the number of instances 
Irom 13 to 16. The choice of the size, 16, was 
made fo match the size of the Random set, Con- 
ditions E and F will be 


also identi- 


compared for a small 


C ———— 


НЗ 
EE 
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TABLE 6 

А STRUCTURAL ANALYSIS OF LANGUAGES 

| 

Language ет | А НЯ | Instances Size | ЕС 
formulas different words | рег юор of set | ECC " 

| | 
А M 0 | 3 | 1 16 | 118 
в TAO | гии 3 | 2 в | 99 
в ее | 3 Е | 18 | 180 
D 0823 NN o 3 | " 23 | 180 
E (19,3) 5 E | __ 9 ; 2 Г m | 90 
Е (19,16) NE; js 9 Г deme | i | 18 
G (11,323); 008 | 3 | 2 23 | 180 
и (19,23). | s | 9 2 23 | 180 


a Same as in В. 
ъ Same as in С. 
increase in the number of instances from 13 to 


16, other things being held constant. 
A similar comparison in size is envisaged for 


Languages (111,3,23). and (111,9,23) Gram- 
mar III consists of five formulas as ist three 
of Grammar Г. A minimum set for Grammar Ш 


instances, which also allows 2 


consists of 23 
instances per loop. All experimental conditions for 
these two sets were held constant, except for the 


size of their vocabulary. 
One could also compare (1,3,13) and (HT, 


323). with each other as minimum sets for 
their respective grammars. Since the number of 
instances per loop has been held constant, any 
differences found between the two languages would 
be attributable either to the number of formulas 
in the grammar or to the number of instances in 


the sets, or to both. 
Ten subjects per condition, using the instruc- 


tions aimed to eliminate speed, were run for each 
of the eight language conditions. 


Results 

Table 7 presents the means and critical 
values for 10 language conditions by the 
usual three measures. Only 8 of these 10 
groups, А through H, concern us in this 
section. We note first of all that in Gram- 
mar I. Conditions В. С, and D, as the in- 
r loop increase from two to three 
to four, strategies improve correspondingly. 
In working with 13 instances а subject 
spends about the same amount of average 


stances pe 


time per word as a subject working with 
the larger sets of 18 or 23 instances, but 
he does not achieve better results. In fact, 
the strategies used under this condition are 
poorer than most and about as poor as the 
ones used by the Random group, which 
used the poorest strategies. 


TABLE 7 


AND CRITICAL. VALUES 


Means, К 


A-D Time 
Language ratio | R score score 
Group EET 
M M|R| M R 
ры ý | 
А Random 
(0, 3, 16) — | 2.8 | 6 11.38 49.24 
В (1, 3, 13) 253 | 3.1 | 7 | 8.14 129.93 
© (3.19) 1389 | 4.6 | 7 | 6.71 23.02 
D (1, 3,23) 598 | 6.5 | 8 | 7.73 [14.27 
Е (1,9, 13) .532 | 6.7 | 6 | 4.996 8.29 
I (1,9, 16), 748 | 8.2 | 7 | 3.916] 4 23 
G (IH, 3, 23). 419 | 4.7 | 8 | 5.396 1.81 
H (IH, 9, 23). .706 | 6.0 | 7 | 5.035 6.53 
I (111,3, 2354 | .541 | 3.5 |8 4.998 11.05 
J (III, 9, 2334 |81 | 7.1 | 7 | 5.478/14.39 
33.37 79.08 


Critical Value 
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Next we note that without any increase 
whatsoever in the number of instances, if 
we merely changed the size of the vocabu- 
lary from three words to nine, performance 
is drastically modified (сї. Column В with 
E and G with H). Now look at the time 
measures. Subjects in Column F take far 
less time than subjects in Column В and 
yet they do far better than subjects in 
Column B. The same is again true, though 
not quite to the same degree, of subjects in 
(111,3,23). and (111,9,23).. Apparently, 
one must look for some other factor(s), in- 
stead of the number of instances, to ac- 
count for these differences between the 
groups. 

A further point of interest may be noted 
from a comparison of Languages (1,9,13) 
and (19,16), Earlier we explained that 
the two languages are identical in every 
respect except for a fortuitous increase in 
the number of instances from 13 in Condi- 
tion E to 16 instances in Г; the results, 
however, that follow this change are not so 
fortuitous. Even a small random increase 
in the number of instances in a language 
with the larger vocabulary apparently makes 
some difference in the subjective redundancy 
of the set for the subjects. But when we 
compare Columns A and F, both sets with 
16 instances in each, the Random set scores 
are much inferior. 

Now compare the minimum sets for both 
grammars. We consider 13 instances for 
Grammar I and 23 instances for Grammar 
ПТ, as minimum sets for each, by definition. 
(1,3,13) is not too different from (IIT,3, 
23). nor is (1,9,13) very different from 
(111,9,23).. Presumably, the effect of the 
size of the set is either negligible or it is 
masked by the effects of other variables. 

To summarize our observations, three 
points seem rather clear: the larger vocabu- 
lary decreases difficulty, more instances per 
formula also decrease difficulty, the effect 
of the total number of instances in a set is 
easily modulated by other variables. 

Tt is not clear, however, as to what effect 
an increase in the number of formulas pro- 
duces upon subjective difficulty. 


Discussion 


Two of the four points summarized 
above pertain to the number of instances in 
the set; they are not unrelated points. The 
total number of instances in a set is auto- 
matically increased with each new addi- 


tion of an instance for any of its formulas. 


This means that two things happen for 
a subject. First, every single addition of an 
instance to the set increases the difficulty 
of the set by increasing the number of rela- 
tions the subject must now study (if Me 
cards have m relations, then n+1 cards will 
have m+n relations). Second, every addi- 
tion of an instance for any of the formulas 
contained in the set simultaneously re- 
duces difficulty by increasing the amount 
of evidence available. This latter gener- 
alization, however, is not true for a random 
set of strings; an increase in the number 
of instances in the Random set does not 
reduce difficulty, but rather increases con- 
fusion. 

In the context of the present discussion 
we define redundant strings as those that 
have been derived from the same gram- 
matical formulas. They repeat information 
by replicating formulas or instances or 
both. Random strings, on the other hand, 
contain no formulas or any orderly repeti- 
tion of information. They could be viewed 
as containing as many formulas as there 
are instances in the set, as was suggested 
previously; as such, they will contain only 
one instance per formula. A redundant set 
which has been generated with one, two, or 
more formulas will contain instances which 
repeat information about the formulas. 
Thus the number of instances per formula 
and the number of formulas that constitute 
the grammar are somewhat interdependent 
features in a language. 


. Random strings are confusing to а sub- 
ject because they are difficult to package. 


They contain more information than any 


redundant language of comparable size 


contains (see Columns A and F) and this 
mass of disjointed information is difficult 
for a subject to handle. The larger the size 
of the Random set the more difficult it 18 
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Гог a subject to process the information it 
contains. Thus, the critical factor for in- 
creasing the efficiency of performance is 
not one of increasing the amount of hap- 
hazard information, but it is one of re- 
ceiving a greater amount of more useful 
information. Reiss (1958) came to the 
‘same conclusion with his subjects when he 
varied the amount of initial information 
presented. 

In the absence of repetition ( redun- 
dancy) of information, relations. between 
instances are few and far between. The 
subject, who is usually disposed to sorting 
his environment by searching for redun- 
dancy, is nonplused in a random set-up. 
Now introduce some redundancy into the 
situation by presenting to the subject two 
instances per formula, instead of one in- 
stance or none, and performance improves. 
Bartlett (1958) made this same observa- 
tion on his subjects in tasks that required 
gap-filling by methods of interpolation and 
extrapolation. In the case of a simple 
formula, a string with one loop only, if one 
now gives the subject just one more in- 
stance, making it three instances per loop. 
the improvement is phenomenal. The sub- 
ject’s response almost begins to approxi- 
mate to the grammatical structure of the 
language. One is led to think that de- 
pending upon the amount of complexity 
of the formulas, one could expect an upper 
ceiling in improvement in performance be- 
yond which an increase in the number о! 
instances ceases to be effective. 

When the number of instances per loop 
is small, say zero or one. it is more possible 
to confuse the instances of one formula 
With the instances of another on the basis 
of superficial relations—what Wertheimer 
(1945) calls “and-summative aggregations 
(p. 190). Actually in the ideal situation, 
where instances have been grouped accord- 
ing to the formulas they exemplify, m- 
Stances which are positive for one formula 
are negative for all other formulas in the 
Set. This, however. is true only when a 
Subject has clearly seen the structure ot 
the formulas in abstraction. Pattern con- 
Ception requires the observer to make a 


two-fold response to the stimulus ensemble ; 
a response of the ingroup type and of the 
outgroup type. The positive instances for 
a formula the subject must “aggregate” 
as members of the ingroup for that subset ; 
all other instances become negative in- 
stances for this formula and which he 
must “segregate” as the outgroup members 
belonging somewhere else. In effect, the 
subject is making an inductive-deductive 
response. Hypothesis making and hypoth- 
esis testing are going on simultaneously 
as a subject decides which instances are 
positive and which are negative for the 
particular category he is working with. His 
certainty increases as the number of in- 
stances that fit his hypothesis increases. 
More instances per formula increase evi- 
dence and consequently decrease the proba- 
bility of error. Tt is а well-known fact in 
communication engineering that one re- 
peats a message as a safety device against 
"noise" in the channel. The noise which 
the negative instances create for a subject 
in our channel can be reduced by increasing 
the number of positive instances, other 
things being equal. 

Thus we are led to conclude that within 
certain limits, the number of instances per 
formula reduces difficulty, subject to cer- 
tain interaction effects from other variables. 
Although an increase in the size of the en- 
semble, within a reasonable range, increases 
the difficulty of the set, the quantitative 
aspect of the set is relatively unimportant. 
Tf, however, the size of the set is too large 
the problem is either rendered insoluble. 
or it becomes ridiculous, depending upon 
the number of basic strings in the set. 

The interaction. effects we referred to 
above have pointed in the direction of the 
size of the vocabulary and of the com- 
plexity of grammar as two possible vari- 
ables that affect pattern conception. Та 
fact, one of the points we noted from our 
results was that a large number of differ- 
ent words reduced the difficulty of the task 
for the subject. We could not sce our way 
the question. of grammatical 


clearly on | 
Let us explore the effects of 


complexity. 
these two variables further. 
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The Number of Different Alternatives and 
Structural Similarity 
(Experiment V) 


Two variables that have been clouding 
the analysis of our data are the parameters 
of the number of different words and of 
structural similarity of the strings. We 
have equated the number of different alter- 
natives with the vocabulary of a language, 
and its grammar, roughly, with structural 
similarity. Two sets of alternatives, con- 
sisting of three words and of nine words, 
constitute the size of the vocabulary of the 
languages to be tested. The generators for 
these languages consist of three basic 
strings, five basic strings with replication 
and five basic strings without replication. 


The Random set of strings has no grammar, 
of course. 


Materials 


Two additional languages, (111,4), with vocabu- 
laries of three and nine words, respectively, were 
constructed to control similarity in the formal 
structure of the grammars. By similarity in struc- 
ture (in this context) is meant: (а) One gram- 
mar contains some information about the other 
grammars. For instance, Grammar (IILd) con- 
tains five formulas, three of which are identical 
with the three formulas of Grammar І, and with 
three of the five formulas of (III,c). (b) Struc- 
tural similarity also means that one formula 
contains some information about the other formulas. 
For example, Formulas 4 and 5 in Grammar 
(III,c) repeat Formulas 2 and 3; in (ПТа) 
they do not. This explanation of the structural 


TABLE 8 
t VALUES For A-D RATIOS AND R Scores 
A-D ratio 
аа ЕЕ 
и ||| = Ева 
ras A | B C|D|E кеш 1 J 
Random A | 
(1,3,13) в | .03 1.97 | 3.35 | 3.00 | 7.35 | .84 | 5.27 | 3.60 | 7.15 
(1,3,18) С [125 | 1.15 2.09 | 1.60 | 5.78 | .16 | 3.88 | 2.41 | 5.77 
(1,3,23) D | 2.23 | 2.76 | 1.40 56 
RE 1.53 | .90 | .96| .57 | 2.00 
19,13 E 5| ss 
(1,9,13) 3.93 | 3.43 | 1.8 15 | 2.45 | .58 | 1.69 | .10 | 2.88 
1,9,16 5 А 
(19,16); F |6.28 | 5.93 | 3.49 | 1.45 | 1.51 1.79 | .52 | 3.35 | .86 
(111,323). С |2.02| 1.55 | .04 2 | 
| || 1.42 Lt 90 | 3.60 1.51 | .66 | 2.06 
(111,9,23). H | 2.66 | 2.50 | 1.10; .36 | .57 | 1.99 | 1.09 3.25 | 1.16 
(111,3,23)4 0 | 2.35 | 7 ETS 
7 030 | 2-33 | 2.89 | 4.74 | 1.10 | 2.08 | 3.68 
(111,9,23)a 3.61 | | as | міт p m 
J .6 | 3-30 IS) 42) 31 | 094 | 1.90 | .81 | 2.85 | 


Note.—t values Гог A-D г; 
not directly comparable for 


p= 4 .05 .02 
t =1.734 2.101 2.552 


atios are entered in upper triangle. R sco alij WA RUE Hem 
ч 1 . R score t values P 5 т : 3 
different sized sets, A-D t values apply for aues are entered in left triangle, Since A-D ratios меге 


sets of equal size, 


-01 .001 


2:878 2009 af = 18 
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similarity in the grammars can be schematized 
thus: 


Grammar I Grammar (III,c) Grammar (п1,а) 


о = 
i 
1 
1 
м 
i 
i 


ным = 
' 
T 
' 
' 
0 
Lv 


©--( 
4. -- )--( 1 === (QO) - 
$&-Q-—Q- &-Q-O-O- 


Results 


In Table 8 are shown results of the # 

tests computed to evaluate the significance 
of the differences between pairs of condi- 
tions. It can be seen from this table that 
performance is significantly better with a 
vocabulary of nine words than of three 
(cf. B with E, G with Н, and I with J). 
But when the grammar changes from three 
basic strings to five, R scores remain pretty 
much the same. 
The A-D ratios rise automatically with an 
increase in the size of the set, as we noted 
earlier, so that the differences do not re- 
flect a difference in the subjects’ perform- 
ance, Time, except for (1,3) remains con- 
stant for the several conditions. 

In comparing the minimum sets of 

(1,3,13) and (19,13) with each other, we 
are impressed with the difference between 
the two conditions as evident from all three 
measures in Table 5. The reader will recall 
that these sets are identical in every respect 
except for vocabulary; the number of 
different words, then, does affect the nature 
of the strategy used. But when we turn to 
Language (1,9,13) and (1,3,23) which vary 
in two parameters—in vocabulary and in 
the number of instances per formula—we 
notice that the R score differences vanish, 
that the A-D ratios for (1,3,23) should be 
(and are) higher by virtue of the size of 
the set, and that time differences now ap- 
pear. 
The Random group and the (1,3,13) 
group arrive at similar strategies but at 
divergent speeds (cf. Columns A and B in 
Table 8). 

Table 9 compares two types of strategies 
in terms of speed of sorting. Individual 
data have been clipped for the two ex- 
tremes of the strategies rated as low and 
high. Column А is a listing of the amount 


of time spent per word by subjects who 
were rated low, that is, who scored either 
1 or 2 оп their strategies, and Column В 
presents the data for all subjects who 
scored 9 or 10. We remind the reader that 


TABLE 9 


TIME SPENT PER SYLLABLE BY INDIVIDUAL SUBJECTS 
ON STRATEGIES RATED BY THE R SCORE SCALE AS 
1 AND 2, AND 9 AND 10 


| 
A B 
Е Strategies Strategies 
Condition rated low rated high 
| (1 апа 2) (9 апа 10) 
Random | 1.61 
(0,3,16) 2.49 
3.09 
1.89 
20.09 
Median 2.79 
(1,3,13) 1.89 
3.67 
4.90 
1.57 
Median 4.285 
(1,9,13) 9.03 
10.18 
3„15 
7.56 
Median 8.245 
(111,3,23)е 2.61 6.32 
111,9,23). 5.44 
6.96 
7.31 
Median 6.96 
(111,3,23)а 8.58 
1.24 
1.54 6.19 
3.22 
Мейап | 2.38 6.19 
(111,9,23)а 4.05 
5.41 
3.13 
4.76 
14.96 
4.20 
Median | | 4.48 
Median—All 
2.55 6.19 


groups 
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strategies which have been rated low are 
based upon a counting device of some sort: 
those rated high, either parallel or are 
equal to the experimenter's grammar. 

Several things seem obvious from Table 
9. The poorer strategies go with the 
smaller vocabulary, not with the larger one 
(see Column A). In the strategies rated 
high (see Column B) there is either no 
subject, or only one subject, per condition 
under a vocabulary of three words. Also 
note that a subject spends more than twice 
as much average time per word in the better 
strategies as a subject using the less effi- 
cient ones. 


Discussion 


The Number of Different Alternatives 
(Vocabulary). The results indicate that 
the size of the vocabulary does affect pat- 
tern conception. We notice that when nine 
different words were used the subject was 
able to decode and classify information 
more readily than when only three differ- 
ent words were used. Since pattern con- 
ception is a matter of seeing meaningful 
relations among the parts of a set of mes- 
Sages, the function of the number of differ- 
ent alternatives should be to convey infor- 
mation to the decoder about these rela- 
tions by pointing up similarities and differ- 
ences among the messages. Information 
about similarity 15 usually conveyed 
through repetition, and about the difference 
by the use of a different symbol. When the 
number of alternatives is too small, the en- 
coder is forced to use the same symbol re- 
peatedly even when the messages are differ- 
ent. His problem is initially the same as 
that of an artist who has to paint a multi- 
colored landscape but who only has one, 
or maybe two, colors to paint with. When 
the number of alternatives is small, for ex- 
ample, three as in some of our languages, 
there is interference in grouping the mes- 
sages because of the exaggerated similari- 
ties among the sequences. With a larger 
number of alternatives, for example, 
larger than nine, one can again expect an 
interference in grouping the messages, but 
this time the interference will be due to 


the exaggerated differences among the 
sequences. This again will tend to obscure 
the pattern. Гегһарѕ between these two 
extremes of too few and too many alter- 
natives there is a number which serves the 
ideal function of emboldening the pattern 
for the decoder, other things being equal. 

Individual data for 3-group (any gram- 
mar group tested with three alternatives) 
show that when faced with haphazard simi- 
larities subjects resort to some kind of a 
counting device. Five subjects out of 30 
in the 3-group followed the Length 
strategy (counting) and 12 subjects in the 
same group used counting of some other 
form. Thus a total of 17 subjects out of 
30 followed some kind of a counting de- 
vice. None of the subjects in the 9-group 
used the Length strategy and only 5 sub- 
jects used some shade of the counting 
Strategy. Seventeen subjects in the 3-group 
as opposed to 5 in the 9-group is not too 
small a difference to overlook. Subjects in 
the 9-group, we suggest, can decode and re- 
code the information better, hence their 
strategies are more closely related to the 
kind of information that is contained with- 
in the set. The strategies of these subjects, 
therefore, correspond more nearly to the 
grammar of the language. 


In Grammar III we notice that although 
subjects spend the same amount of average 
time under all four conditions they do not 
employ equally good Strategies in all the 
four conditions. Strategies used by the 9- 
group are more successful than the ones 
used by the 3-group. Thus once again evi- 
dence points to the number of different 
alternatives as a critical variable. 

Now turn to Conditions B and D 
Table 7 to make an intralan 
son within С 
that there is 
vocabulary 


in 
guage compari- 
rammar I. The results suggest 
an interaction effect between 
and the number of instances 
for each recursive loop in a basic string. 
Although the number of alternatives ге- 
mains three, performance improves with an 
Mierease in the number of instances (sec 
D). With four instances for each loop, 
subjects perform almost as well аз sub- 
Jects of (19,13) ; they take more time with 


| 
| 
| 
| 
| 
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the set, however. Whether this increase in 
time for the (1,3,23) group is due to the 
larger size of the set or because it takes 
longer for the 3-group to find the pattern 
than it takes the 9-group, it is hard to say. 
Since groups (1,3,13) and (1,3,23) take 
the same amount of time for different sized 
languages, it could be argued that set size 
is less critical than the vocabulary factor. 


Structural Similiarity (Grammar). The 
other notable parameter in pattern concep- 
tion seems to be the structural similarity 
of sequences. In their studies on visual 
pattern recognition, Fitts et al. (1956) 
raised the problem of “different types and 
degrees of redundancy and of examining 
the effects of complexity of variables in- 
fluencing pattern recognition” (р. 9). 
Although the measure of redundancy does 
not apply to our materials, as pointed out 
earlier, the notion of repetitiveness does. 
By substituting the phrase structural simi- 
larity for redundancy in the quotation from 
Fitts et al. we can state our own problem 
thus: What is the effect of the type and 
degree of structural similarity on pattern 
conception? 

Similarities among sequences depend 
upon two kinds of repetitiveness: the repe- 
tition of words because of the limited 
vocabulary used and repetition due to the 
grammatical structure of the strings. In 
the previous section we discussed repetitive- 
ness based upon the number of different 
alternatives. Structural repetitiveness was 
defined in terms of the number of basic 
strings contained in each language and the 
number and placement of the loops within 
the basic formulas. A particular grammar 
selects a particular type of repetitiveness. 
The problem arising from such selection is: 
Is the pattern of similarity and difference 
more difficult in one set than in another? 

In scaling the languages for structural 
similarity, one could begin with the zero 
condition in grammatical rules, as in the 
random set of strings, where rules of for- 
mation are absent. Results show that in the 
tructural similarities among the 


absence of $ A 
peripheral 


strings а subject searches for 


similarities and he settles for the simpler 
strategies such as those of Length or of 


Initial similarity. Peripheral strategies, 
however, are not solely the function of 
nongrammatical structure. We noted 


earlier, that when the number of alterna- 
tives is small, as іп (1,3,13), that even 
when the language has a grammar, the 
grammar of the set becomes so obscure 
through word repetition that a subject 
treats the set as a random one. In such an 
instance, the strategies used for a gram- 
matical set will approximate those of the 
random set. 

When we turn to the finite state sys- 
tems, which of course are grammatical in 
structure, structural similarities consist in 
the number and kind of formulas соп- 
tained within the grammars as explained 
before. Some questions that now arise are: 
(a) When a system is enlarged by adding 
one or more parts, do we increase variance 
or its invariance or both? Thus, are three 
formulas easier to handle than five? (5) Is 
the quality rather than an increase in the 
number of parts a critical factor in strategy 
construction? Are five rules with replica- 
tion better than five rules without replica- 


tion? Моге specifically, is Grammar 
(IILc) more simple than (1,3) and 


(III,d) or is it more complex? 

The results do not give any clear indica- 
tion either as to the degree or the direc- 
tionality of the grammar effects. In the Ex- 
ploratory Study, the data indicate some im- 
provement in performance with the in- 
crease in the number of basic strings, from 
two strings to four, and to five (see Table 
3), but the time also increased correspond- 
ingly. The better controlled experiments of 
the later studies give no clear indication of 
significant change in performance with an 
increase in the number of basic strings, the 
number of states, and the number of rules 
of transition. The relative difference in the 
grammar effects based on these factors is 
hard to assess for two reasons: first, there 
are interaction effects of other variables 
with grammar; second, there is no adequate 
measure of structural similarity. Since 
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grammatical similarity is more structural 
than peripheral, it can easily elude a sub- 
ject but unless a subject takes advantage 
of grammatical repetitiveness implicit in 
the language, he fails to see structure. How 
well he will work with the grammar of the 
language will depend, in part, upon the way 
the messages have been encoded and, in 
part, upon how often they have been re- 
peated. Without a sufficient number of 
different alternatives to indicate similarities 
and differences among the parts of the sys- 
tem, and without an adequate number of 
repetitions of the messages, the grammar 
of the system will remain obscure to a sub- 
ject and its effect almost negligible upon his 
recoding behavior. On the other hand, 
when a subject does figure out the gram- 
matical principle in any one subset, it 
seems to matter less how many subsets there 
are in the entire system. Tt then becomes 
simply a matter of transfer of a principle. 
Essentially, we can argue that there is only 
one basic principle of variance and invari- 
ance involved in a finite state system. All 
the formulas—either one, or three, or five, 
or any number—are merely variations of 
this single principle of additions or dele- 
tions of symbols to a basic, invariant, se- 
quential pattern. Usually, a subject applies 
one method of classification fairly con- 
sistently to all instances in the set. Tf this 
method of classification happens to approxi- 
mate to the experimenter’s grammar, it 
seems to matter little whether three for- 
mulas have been built into the structure of 
the language or five. And if his strategy is 
different from the experimenter’s grammar, 
then it matters even less how degenerate 
the grammar is, because the subject has 
missed the point of the grammar, anyhow. 
We would like to venture another guess 
to explain the absence of a clear difference 
in the Grammar groups as measured by the 
kind of strategies subjects used. Presum- 
ably, each time one or more parts is added 
to a grammatical system, the information 
function is jointly and simultaneously 
affected by the increase in complexity and 
redundancy of the system. In other words, 
confusion and facilitation go hand jn hand 


in grammatical redundancy and these effects 
tend to cancel each other. Thus more for- 
mulas neither make the task easier nor do 
they make it more difficult. 

Although the quantitative factors of 
strings, states, and rules of transition seem 
to make little difference in performance in 
the present study, recursive loops create 
some problems for a subject. Two common 
difficulties pertaining to loops characterize 
the responses of most of our subjects. 
These difficulties we shall label as: the 
zero concept difficulty and the double-loop 
concept difficulty. 

The zero concept: It was not infrequent 
to find a subject puzzled about the basic 
string of a formula. For example, if one 
of the formulas in a grammar was a (c) 
b b a, its basic string, а (c)® b b a, was 
often held as a separate card for a con- 
siderable period of time and was even 
placed in a separate category by some sub- 
jects. This observation seems more typical 
of the double-loop formula to be discussed 
below. When a subject had decoded a 
grammatical rule as the experimenter had 
structured it, he could easily see the arith- 
metical progression in the development of 
à loop move from 1 to n, even though quite 
deliberately some exponents in the progres- 
sion had been left out. But it seemed diffi- 
cult for a subject to see that the exponent 
of the loop could also be zero. Viewed 
historically, the zero concept was not easily 
come by either. Perhaps it is difficult for 
one to think of starting а series with 
nothing. Normal counting frequently be- 
gins with 1 and moves up to n. Tt is, there- 
fore, quite possible that this way of count- 
ing interferes with the necessity to begin 


the series with the zero notation 
formulas. 


The  double-Ioop concept: The zero 
concept seemed even more difficult for sub- 
jects to handle in the double-loop for- 
mulas. This was 50 because the double- 
loop in itself was 4 hard concept for the 
subject. We have described the double- 
loop as a formula consisting of two vari- 
ables, instead of just one, within an in- 
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variant sequence of symbols. It has been 
schematized thus: a (c) b c (b) a. 

An R score of 8 or 9 signifies that sub- 
jects broke up the double-loop subset into 
three further sub-subsets as follows: 
а (c)'"bca, a b c (b)™ a, and a (c)? 
b c (b)** a. The basic string was either 
held as a separate category, or it was placed 
with one of the first two sub-subsets. In 
watching subjects at work, it was fairly 
obvious to the experimenter that subjects 
were having considerable difficulty with the 
double-loop subset. They insisted upon ap- 
plying the simple rule of one loop varying 
from 0 or 1 to » to this complex formula, 
and were confused with the similarity within 
the three sub-subsets. This can be very 
clearly seen in the thinking-aloud responses 
of our subject in the Illustrative Study. One 
could explain this difficulty on the basis of 
two characteristics of pattern response. The 
first, that the subject formulates his method 
early in the game and more or less stays 
with it; and the second, that the subject 
seeks to apply one pattern consistently 
throughout to all parts of the system. 

Reiss pointed out that a subject con- 
structs his “model” fairly early, and then 
begins to stabilize around it as he tests it 
with successive negative or positive in- 
stances. From the thinking-aloud responses 
of our subjects and on the basis of data 
from other studies on the evolution. of 
strategies not reported here, Reiss seems to 
be right. Those subjects who initiated a 
counting strategy in the beginning showed 
a strong counting bias. and those who used 
the initial strategy stuck pretty closely to 
the left-to-right rule of comparing se- 
Exceptions did occur in the Ran- 
dom group, or when the method of pres- 
entation of instances was varied. In the Ran- 
dom set, one could find the subject trying 
all sorts of strategies, and finally settling 
on a perceptual one, when nothing else 
seemed to work. But in the sets where the 
subject had once perceived the structural 
order in a subset fairly early, or even later, 
he did not give it up. This does not mean, 
of course, that there is no further refine- 
strategy with an increase in the 


quences. 


ment of the 


amount of evidence. It simply means that 
more instances only help to sharpen or 
modify the “model” already constructed. 

This kind of fixation in strategy ties in 
with the point we are making about con- 
sistency in sorting. By and large, most 
subjects work with one method of classi- 
fication for all the subsets they make. They 
do not seem to want to shift gears in their 
strategy from subset to subset. This 
characteristic way of responding is inde- 
pendent of the input, and it seems to give 
us some information about the human chan- 
nel itself. It is, therefore, important to 
note that not only does a subject organize 
the input sequences into categories, but 
that he does this on the basis of a con- 
sistent principle. 

Time 

Since time did not seem to be a simple 
function of a single variable we have post- 
poned its discussion; a few remarks are in 
order now. We note first, that the amount 
of time taken for the task is not a unique 
function of the size of the set although 
size may have something to do with it. 
Next we note that time seems to be posi- 
tively correlated with strategy. And finally 
we see that time is affected by the size of 
the vocabulary, though not without excep- 
tion. 

Time and Size of Set. Tirst, consider 
the relation between time and size of set. 
One is impressed by a lack of orderly vari- 
ation in the amount of time corresponding 
to an increase in the size of the set. The 
group total time spent on a set of 13 instances 
was 49.56 seconds for the larger and 80.36 
seconds for the smaller vocabulary. When 
the size of the set increases to 16 (larger 
vocabulary), time reduces to 39.16 seconds, 
and when the size increases to 18 instances 
(smaller vocabulary), time goes up to 67.16 
seconds. Subjects take more time with a set 
of 13 instances developed with a vocabu- 
lary of three words, than they do with a 
set of 23 instances irrespective of vocabu- 
lary size, The small number of instances 
does not seem to make the subject's task 


more manageable, but rather, it seems to 
make it more difficult, possibly by obscur- 
ing the pattern. А small set of instances 
gives him less evidence upon which to base 
his induction of the appropriate categories. 
The variables which interact with the size 
of the set apparently are far more critical 
in the time measure than the size (within 
limits) of the set itself. It is difficult to 
determine the true time function on the 
basis of adding just one more instance to 
the set. It seems one must know what that 
one more instance is, and also, what some 
of the other characteristics of the set are. 
Time and Strategy. The longer time 
measures are correlated with the more suc- 
cessful strategies. Even at the risk of ex- 
plaining the obvious, we shall state our 
point: complex strategies take more think- 
ing and thinking takes time. We define 
thinking in this situation as a search for 
relations among parts of the information 
received, in order to recode this informa- 
tion into suitable units for use and for 
storage. The human link in any communica- 
tion system is responsive not only to the 
immediate reception and use of informa- 
tion—its function is a far more complex 
one of storing this information for later 
use. As a finite system, the organism is 
confronted with an infinite amount of in- 
formation from its environment ; the quality 
of its efficiency is on trial on the basis of 
the recoding scheme the system envisages. 
How much and how well the old informa- 
tion can be reconstructed from the recoded 
units will determine the degre of efficiency 
of the thinking involved. A structural 
strategy makes the reconstruction of the 
original message far more probable than a 
simple counting strategy, for instance. It 
is, therefore, not difficult to see why it 
would also take more time. Even Donders 
(cited in Boring, 1950, pp. 147-149) was 
suggesting a hundred years ago that the 
longer reaction times were an indication of 
complex processes. We might speculate at 
this point and add that if we could force 
each subject to work with the information 
given in the set for a longer period of time 
(as we forced the subject described in the 
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Illustrative Study) by presenting the in- 
formation to him one instance at a time, 
we would expect more subjects, possibly all. 
to come closer to the original structure oi 
the grammar, despite the complexity of the 
stimulus conditions. For example, one 
subject in the (1,3,13) group spent next to 
the longest time of any and came up with a 
better solution than any of the other sub- 
jects in that group. Give a man sufficient 
time and he will find some way to reduce 
the heterogeneity of his environment to 
lawful change. 


CONCLUDING OBSERVATIONS 


Tt is clear from the data that pattern con- 
ception is a characteristic way in which 
people respond to a set of linguistic s€- 
quences, irrespective of the structure of 
the sequences. In an "open system" such 
as ours (unstructured in instruction and 
in meeting any given criteria) the subject 
must define the problem himself, construct 
some sort of working hypothesis to deal 
with the information he has extracted from 
the input, and look for evidence to see if 
his hypothesis holds. Не then discovers 
gaps in the information he has acquired, 
he searches for new evidence to fill the 
gaps, and finally he accepts his plan or 
rejects it if the gaps remain unfilled. 
Usually, though a subject may show in- 
tolerance of uncertainty, he may occasion- 
ally decide to live with his problem. The 
strategy or plan he settles on is partially a 
function of some experimental variables. 

gi was interesting to observe that remi- 
niscent of the Kantian categorical impera- 
tive, our subjects seemed to be bound by a 
categorizing imperative. And what is more, 
their categorizing seems to fall within a 
limited range. Whether there were 13 in- 
stances in а set, or 50, and whether there 
were 15 subjects tested or 150, it did not 
seem to matter; the mean number of cate- 
gories ranged between five and seven (sce 
Figure 4). By some strange coincidence. 
the integer that has been “persecuting” 
Miller (1956) for a decade, turns up once 


ом in our data. What does one make of 
it? 


= 
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Figures 4, 5, and 6 all relate to the num- 
ber of categories subjects make. The func- 
tion in Figure 4 represents data for the 
mean number of categories subjects made 
ior the different sized sets. Data show 
that the number of categories increases 
somewhat with an increase in the total num- 
ber of instances in the set, but that this in- 
crease is proportionately less. 

The average frequency of the different 
types of categories subjects make is pre- 
sented in Figure 5. By the type of category 
is meant the number of instances subjects 
put into one category; for example, there 
may be a one-instance category, а category 
with two instances in it, with three, and so 
on. The number of instances in a category 
and the total number of categories are in- 
terdependent factors, of course, Either а 
subject prefers to make a certain number 
of categories, or he prefers to put a certain 
number of instances together in one cate- 
gory. The more irequent categories are 
those with two, three, or four instances 
per category. 

Figure 6 treats the same data in a slightly 
different way. It presents individual cumu- 
lative curves for different language condi- 
tions to show the probability that a subject 
will make at least + categories. Note that 
for all conditions 50% of the subjects aver- 
age between five and six categories. 
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Я Mean number of instance categories 
per size of set. (Instance category refers to the 
number of instances placed in a category. The 
y axis represents what percentage an instance 
category is of the total number of categories made 
in each language. For example, in the 23-instance 
languages, 50 subjects made a total of 338 cate- 
gories, of which 54 categories were one-instance 
categories.) 
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We have stated above that although the 
number of instances in a set has something 
to do with the number of categories sub- 
jects make, it is not a determining factor. 
The relation of the number of basic strings 
in a set to the number of categories seems 
even less important, except when a subject 
sees the grammatical pattern accurately. 
Quite frequently subjects wished to know 
how many categories they were expected to 
make. When they were informed that the 
choice of number rested with them, they 
seemed to guide their sorting by a desire 
not to have “too many” categories. This 
was especially true of subjects employing 
some type of formula strategies. “It’s a 
shame to have a one-card category” was 
not unusual for a subject to say when an 
remained unclassified. Tf, how- 
ever, the choice before the subject was be- 
tween consistency in sorting and the num- 
ber of categories, he preferred consistency 
to number. 

Seven, then, is still a m 
“number seven, plus or minus two" de- 
notes according to Miller (1956), "the 
span of absolute judgment" and the "span 
of memory" (p. 90). We interpret the spans 
to mean that there is а finite number of 
isolated events, or of groups of events, 
within the neighborhood of seven, which 
the human link in а communication system 


instance 


agic number. The 
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Fic. 6. The probability that a subject will make at least v cate 
nine groups of subjects. (The S-shaped functions represent 41 


probability paper for each group separately. 


can best handle. It seems, then, that in 
dealing with a bulk of information the 
human channel is constrained by its “сһап- 
nel capacity," to use Miller's terminology, 
to process the given information within the 
given range of categories. It seems as 
though, because of the limits imposed by 
the memory span upon pattern conception, 
one might begin to develop a notion of con- 
ceptual span as well. Not only is memory 
limited by the number of psychological 
units it can handle, but pattern conception 
must function under the same limitation. 
Although the span of immediate memory 
is limited by the number of items, the 
amount of information the items contain 
can be increased by a process of recoding. 
Pattern conception is a description of the 
recoding process itself, The linguistic re- 
coding involved in the type of studie 
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of memory and thinking, The study is in 
its infancy; much more research is war- 
ranted. 

SUMMARY 


It is advantageous for an organism, for 
both communication and storage of infor- 
mation, to organize and symbolize the 
stimulus energies in its environment. Per- 
ceptual organization is one way to reduce 
the apparent variabilitv of the environment, 
conceptual symbolization is To- 
gether, information 
an organism with power- 
ful weapons to deal efficiently with the 
heterogeneity of the environment, Numer- 
ous psychological studies have indicated 
that perceptual recoding is limited by the 
amount of information given; is concep- 
tual recoding similarly related in any speci- 
fiable way to the given information ? 


another, 
these two forms of 


processing provide 
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Conceptual recoding is here discussed in 
terms of the discovery and formulation of 
rules to describe cognitive structures. Lan- 
guage provides one of the best illustrations ; 
verbal recoding of experience seems to be 
the essence of human thinking. It is to lan- 
guage that one can turn, therefore, to find 
the materials needed for experimental study 
of the processes of conceptual recoding. 
Natural languages, however, are tremen- 
dously complicated; some of the same 
processes can be studied more simply with 
artificial language systems. In the present 
study finite state systems were used to 
simulate natural languages. They are 
simpler, their formal structure can be 
stated precisely, yet they provide the kind of 
sequential patterning that is characteristic 
of all verbal communication. Subjects were 
given a set of linguistic strings, random or 
constrained, generated by some finite state 
systems; they were asked to group the 
strings in some appropriate way. In order 
to perform such a task all subjects employ 
some intelligible, recognizable plan for sort- 
ing the strings. The number of different 
strategies that are observed to occur is 
small and subjects are easily classified ac- 
cording to the strategy they adopt. Some 
strategies аге more complicated than 
others; they range from simple, ready-made 
rules for counting and alphabetizing to 
ad hoc inventions of rules that a subject 
considers implicit in the information he has 
been given. The former are more percep- 
tual, the latter more conceptual in spirit, 
but both are ways of describing the struc- 
tural rules of the system. 

Pattern conception depends upon the 
amount of the evidence supplied 


kind and 1 
In the present studies, 


by the experimenter. : 
the length of the messages. the number of 
instances, the number of alternatives, and 
structural similarity are some of the ex- 
perimental variables by which messages 
were varied. The results show that the 
variety of symbols (number of different 
alternatives) with which the messages are 
constructed is far more critical іп con- 
trolling pattern conception than is the 


length or the structure of the messages. 


Two sets of messages, identical in length, in 
rules of formation, and in the instruction 
given subjects, but differing in the number 
of symbols, elicit qualitatively different re- 
sponses. A large variety of symbols tends 
to elicit the invented strategies whereas the 
small variety generally leads to ready- 
made rules. This result indicates that the 
subject is able to recode the given informa- 
tion more effectively when there is a variety 
in the symbols from which it is composed. 

А parallel advantage is observed by in- 
creasing the amount of evidence provided 
to a subject for a given rule. The number 
of instances per rule was varied from two 
to three to four instances, while other fac- 
tors of vocabulary, grammar, and instruc- 
tions were held constant. The strategies 
that subjects used under each of these con- 
ditions were evaluated in terms of two sta- 
tistical techniques developed specifically for 
the purpose of strategy analysis. As the 
amount of evidence increased, the strate- 
gies improved. A larger quantity of evi- 
dence facilitates recoding of the sequences 
in terms of variant and invariant symbols. 

Earlier experiments on perception have 
indicated that redundancy may be of several 
kinds, and that the influence of redundancy 
upon responses varies with its quality. 
Although we have not applied an informa- 
tion measure of redundancy to our ma- 
terials (the finite state systems that we 
used are too simple to require such statisti- 
cal treatment) we have worked with a 
related conception of structural similarity 
(rules of formation) in linguistic se- 
quences. Languages varying in number and 
complexity of rules of formation, but 
similar in length and size of vocabulary. 
do not yield markedly different results. The 
fewer rules embedded in a language do not 
necessarily make that language simpler. 
Data indicate that structural redundancy is 
less critical in pattern conception than is 
the variety in symbols or the amount of 
evidence. More experiments with better 
controls on redundancy are needed to de- 
termine the effect of this variable оп pat- 


tern conception. 
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Two specific difficulties were found to 
characterize the responses of many subjects 
in almost all of the experimental groups. 
These difficulties were the concept of zero 
and the concept of double variation in a 
linguistic sequence of variant and invariant 
symbols. Subjects could delete or add the 
variable from 1 to и fairly easily, but they 
struggled with the string when the exponent 
of the variable was equal to zero. In the 
case of more than one variable in a string 
subjects tended to group its instances into 
as many subsets as there were variables in 
the string. This multiple-loop difficulty is 
not unrelated to the zero difficulty, how- 
ever. 

Pattern conception, apparently, is not a 
simple function of any one experimental 
variable. This point was most clearly evi- 
dent from the time measures, which did not 


correlate consistently with any one experi- 
mental condition. It was reasonably clear, 
however, that the longer time measures cor- 
related positively with the better strategies. 
Presumably, the central processes of re- 
coding and thinking take time, which causes 
the delay in response. There was little evi- 
dence for an increase in time with the in- 
creasing length of the set. 

Finally, all studies led us to conclude that, 
irrespective of the type of finite state sys- 
tem used—its structure, vocabulary, length, 
or anything else—the average number of 
categories that subjects make is five or six. 
Although we were not looking for this 
point, this result is in keeping with the 
findings from some studies on “channel 
capacity,” and it indicates that thinking like 
memory is centrally affected by the channel 
capacity of the organism. 


7 
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APPENDIX А 


AGREEMENT-DISAGREEMENT RATIO 


The Agreement-Disagreement ratio is a sta- 
tistic devised to measure the amount of correla- 
tion between sortings made by any two persons. 
The technique of calculating the amount of 
agreement between different sortings is as 
follows: 


Consider the sortings made by any two sub- 
jects X and Y: 


v—( 2 4.5) (3) (6 10 12 15) 


(78911 13 14) (16) 


Step 1. Construct sets from the intersection 
of these two classifications : 


Intersection (1 24 5) (6 12) (7 8 11 13) 
(9 14) (10 15) (3) (16) 


The intersection is constructed in such a way 
that every set in the data for X and Y can be 
reconstructed by combining sets from the inter- 
section, and yet the sets in the intersection are 
as large as possible. Otherwise said, every 
distinction among the strings that either Х or 
Y indicated is preserved in the intersection. 


Step 2. Calculate the number of agreements 
for each set. For example, if а set contains 
four strings there are 3 + 2 + 1 = 6 pairs of 
strings that agree with one another in the 
sense that they are placed in the same class 


and considered to be equivalent. Below are 


tabulated the number of pairs of strings that 
agree as a function of the size of set. 


size number agreements 
1 0 
2 1 
3 3 
4 6 
5 10 
x r(r—1)/2 


Thus, for example, a set of 11 strings would 
yield 55 pairs that were in agreement. If size 
is x then the number of agreements is given by 
the equation А = x(x—1)/2. So the second 
step is to add up the number of agreements in 
this sense for X, Y, and the intersection: 


subject X subject Y intersection 
5 10 4 6 4 6 
x] 3 1 0 0 
2 1 4 6 2 1 
6 15 6 15 + 6 
29 1 0 2 1 
27 2 sU 
1 0 
15 


Step 3. Calculate {һе A-D ratio from these 
three numbers, 20, 27, 15, by substituting them 


: into the following equation: 


X + y — 2 Intersection 
60 


т = 1 — 29-4 27 —2 (15) 26 


т=1— 


=1— 443 = 567 
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APPENDIX B 


RATING SCALE For SUBJECTS’ STRATEGIES 
< 
The second column gives the average scores for 
the ratings assigned to 20 strategies by five judges. 
The first column rounds off the average scores for 
purposes of establishing a 10-point rating scale. 


Final Rating Average 


Score Score* Kind of Strategy 
1 1.0 General Size 
1.5 Length 
2 2.4 Initial, or Middle, or 
Final; 
253 Alphabet 
3 2.7 One striking word with 


Initial or Final one 

3.1 Initiale ог пу, Middle or n), 
Finale ог п) 

3.1 Initial;-Final, 

3.4 Length plus Alphabet 

3.4 Initial; plus Alphabet or 
Initial plus Length 


3.6 Rhythm 
4 3.8 Mixed methods for entire 
set 
3.8 Mixed methods by groups 
4.3 Alternating word or com- 
plex counting 
5 5.3 Symmetry and Rhythm 
5.6 Initial plus complex sub- 
stitution or ordering 
6 6.2 Initialise or »-Finale ог 3) 
7 7.4 Maximum Identity 
8 8.2 Initial-Middle-Final 
9 9.0 Less than basic string 


(breaking up the for- 
mula by loops) 


10 10.0 Formula (same as experi- 
TM. menter's grammar) 


* The steps between scores are not necessarily equal, 
Р . 
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